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The invention relates to the discovery of a novel RNA sequence at the 
3' terminal sequence of hepatitis C virus (HCV) genome RNA. Included in the 
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based diagnostics and for developing and evaluating novel anti-HCV therapies. 
This sequence element, which is conserved among HCV genotypes, is likely to 
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competent HCV repl icons. Such functional clones are useful tools for evaluation 
of therapeutic approaches and as substrates for developing candidate attenuated 
or inactivated HCV derivatives for vaccination against HCV. 
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NOVEL 3' TERMINAL SEQUENCE OF HEPATITIS C VIRUS GENOME 
AND DIAGNOSTIC AND THERAPEUTIC USES THEREOF 

GOVERNMENTAL SUPPORT 

This work was supported by National Cancer Institute/National Institutes of Health grant 
number CA57973. The government may have certain rights in this invention. 

TECHNICAL FIELD OF THE INVENTION 

The present invention relates generally to a novel nucleotide sequence element identified 
at or near the 3' terminus of the hepatitis C virus (HCV) viral genome RNA. This 
element is highly conserved among HCV genotypes and may be essential for HCV 
replication. 

BACKGROUND OF THE INVENTION 

After the development of diagnostic tests for hepatitis A virus and hepatitis B virus, an 
additional agent, which could be experimentally transmitted to chimpanzees (Alter et af, 
1978; Holiinger et al, 1978; Tabor et al, 1978), became recognized as the major cause 
of transfusion-acquired hepatitis. cDNA clones corresponding to the causative non-A 
non-B (NANB) hepatitis agent, called hepatitis C virus (HCV), were reported in 1989 
(Choo et al, 1989). This breakthrough has led to rapid advances in diagnostics, and in 
our understanding of the epidemiology, pathogenesis and molecular virology of HCV 
(see Houghton et al, 1994 for review). Evidence of HCV infection is found throughout 
the world and the prevalence of anti-HCV antibodies ranges from 0.4-2% in most 
developed countries to more than 14% in Egypt (Hibbs et al, 1993). Besides 
transmission via blood or blood products, or less frequently by sexual and congenital 
routes, sporadic cases, not associated with known risk factors, occur and account for 
more than 40% of HCV cases (Alter et al, 1990; Mast and Alter, 1993). Infections are 
usually chronic (Alter et al, 1992) and clinical outcomes range from an inapparent 
carrier state to acute hepatitis, chronic active hepatitis, and cirrhosis which. is strongly 
associated with the development of hepatocellular carcinoma. Although alpha IFN has 
been shown to be useful for the treatment of some patients with chronic HCV infections 
(Davis et al, 1989; DiBisceglie et al, 1989) and subunit vaccines show some promise in 
the chimpanzee model (Choo et al, 1994), future efforts are needed to develop more 
effective therapies and vaccines. The considerable diversity observed among different 
HCV isolates (for review, see Bukh et al, 1995), the emergence of genetic variants in 
chronically infected individuals (Enomoto et al, 1993; Hijikata et al, 1991; Kato et al. 
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1992; Kato et al, 1993; Kurosaki et al, 1993; Lesniewski et at, 1993; Ogata et ai, 
1991; Weiner et al, 1991; Weiner et al, 1992), and the lack of protective immunity 

elicited after HCV infection (Farci et al, 1992; Prince et ai, 1992) present major 

challenges towards these goals. 

Molecular biology of HCV 

Classification. Based on its genome structure and virion properties, HCV has 
been classified as a separate genus in the flavivirus family, which includes two other 
genera: the flaviviruses Isuch as yellow fever virus <YF)] and the animal pesti viruses 
[bovine viral diarrhea virus (BVDV) and classical swine fever virus (CSFV)] (Francki et al, 
1991). All members of this family have enveloped virions that contain a positive-strand 
RNA genome encoding all known virus-specific proteins via translation of a single long 
open reading frame (ORF; see below). 

Structure and physical properties of the virion. Little information is available on 
the structure and replication of HCV. Studies have been hampered by the lack of a cell 
culture system able to support efficient virus replication and the typically low titers of 
infectious virus present in serum. The size of infectious virus, based on filtration 
experiments, is between 30-80 nm (Bradley et al, 1985; He et al, 1987; Yuasa et al, 
1991). HCV particles isolated from pooled human plasma (Takahashi et al, 1992), 
present in hepatocytes from infected chimpanzees, and produced in cell culture (Shimizu 
et al, 1994a) have been visualized (tentatively) "by electron microscopy. Initial 
measurements of the buoyant density of infectious material in sucrose yielded a range 
of values, with the majority present in a low density pool of < 1.1 g/ml (Bradley et al, 
1991). Subsequent studies have used RT/PCR to detect HCV-specific RNA as an 
indirect measure of potentially infectious virus present in sera from chronically infected 
humans or experimentally infected chimpanzees. From these studies, it has become 
increasingly clear that considerable heterogeneity exists between different clinical 
samples, and that many factors can affect the behavior of particles containing HCV 
RNA (Hijikata et al, 1993; Thomssen et al, 1992). Such factors include association 
with immunoglobulins (Hijikata et al, 1 993) or low density lipoprotein (Thomssen et al, 
1992; Thomssen et ai, 1993). In highly infectious acute phase chimpanzee serum, 
HCV-specific RNA is usually detected in fractions of low buoyant density (1.03-1.1 
g/ml) (Carrick et al, 1992; Hijikata et al, 1993). In other samples, the presence of HCV 
antibodies and formation of immune complexes correlate with particles of higher density 
and lower infectivity (Hijikata et al, 1993). Treatment of particles with chloroform, 
which inactivates infectivity (Bradley et al, 1983; Feinstone et al, 1983), or with 
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nonionic detergents, produces RNA containing particles of higher density (1.17-1.25 
g/ml) believed to represent HCV nucleocapsids (Hijikata et al, 1993; Kanto et ai, 1994; 
Miyamoto et al, 1992). 

5 There have been many reports of varying levels of negative-sense HCV-specific RNAs in 
sera and plasma (see Fong et al, 1991). However, it seems unlikely that such RNAs are 
essential components of infectious particles since some sera with high infectivity can 
have low or undetectable levels of negative-strand RNA (Shimizu et al, 1993). The 
virion protein composition has not been rigorously determined, but putative HCV 
10 structural proteins include a basic C protein and two membrane glycoproteins, E1 and 
E2. 

HCV replication. Early events in HCV replication are poorly understood. Cellular 
receptors for the HCV glycoproteins have not been identified. The association of some 

1 5 HCV particles with beta-lipoprotein and immunoglobulins raises the possibility that these 
host molecules may modulate virus uptake and tissue tropism. Studies examining HCV 
replication have been largely restricted to human patients or experimentally inoculated 
chimpanzees. In the chimpanzee model, HCV RNA is detected in the serum as early as 
3 days post-inoculation and persists through the peak of serum alanine aminotransferase 

20 (ALT) levels (an indicator of liver damage) (Shimizu et al, 1990). The onset of viremia is 
followed by the appearance of indirect hallmarks of HCV infection of the liver. These 
include the appearance of a cytoplasmic antigen (Shimizu et al, 1990) and 
ultrastructural changes in hepatocytes such as the formation of microtubular aggregates 
for which HCV previously was referred to as the chloroform-sensitive "tubule forming 

25 agent" or TFA" (reviewed by Bradley, 1990). As shown by the appearance of viral 

antigens (Blight et al, 1993; Hiramatsu et al, 1992; Krawczynski et al, 1992; Yamada et 
al, 1993) and the detection of positive and negative sense RNAs (Fong et al, 1991; 
Gunji et al, 1994; Haruna et al, 1993; Lamas et al, 1992; Nouri Aria et al, 1993; 
Sherker et al, 1993; Takehara et al, 1992; Tanaka et al, 1993), hepatocytes appear to 

30 be a major site of HCV replication, particularly during acute infection (Negro et al, 

1992). In later stages of HCV infection the appearance of HCV-specific antibodies, the 
persistence or resolution of viremia, and the severity of liver disease, vary greatly both 
in the chimpanzee model and in human patients. Although some liver damage may 
occur as a direct consequence of HCV infection and cytopathogenicity, the emerging 

35 consensus is that host immune responses, in particular virus-specific cytotoxic T 

lymphocytes, may play a more dominant role in mediating cellular damage (see Rice 
and Walker, 1995 for review). 
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It has been speculated that HCV may also replicate in extra hepatic reservoir(s), 
particularly in chronically infected individuals. In some cases, RT7PCR or in situ 
hybridization has shown an association of HCV RNA with peripheral blood mononuclear 
cells including T-cells, B-cells, and monocytes (Blight et ai f 1992; Bouffard et a!, 1992; 
Gil et al, 1993; Gunji et al, 1994; Moldvay et at, 1994; Nuovo et al, 1993; Wang et al, 
1992; Young et al, 1993; Yun et al. 1993; Zignego et al, 1992). Such tissue tropism 
could be relevant to the establishment of chronic infections and might also play a role in 
the association between HCV infection and certain immunological abnormalities such as 
mixed cryoglobulinemia (reviewed by Ferri et ai, 1993), glomerulonephritis, and rare 
non-Hodgkin's B-lymphomas (Ferri et al, 1993; Kagawa et al, 1993). However, the 
detection of circulating negative strand RNA in serum, the difficulty in obtaining truly 
strand-specific RT/PCR {Gunji et al, 1994), and the low numbers of apparently infected 
cells have made it difficult to obtain unambiguous evidence for replication in these 
tissues in vivo. 

Although a cell culture system capable of efficient HCV replication has not been 
developed, some progress has been made. Consistent with the in vivo observations 
mentioned above, in vitro HCV infection and short term replication have been reported 
for chimpanzee and human hepatocytes (Carloni et al f 1993; lacovacci et al, 1993; 
Lanford et al, 1994), a human hepatoma line (Huh7; Yoo et al, 1995, see below), 
peripheral blood leukocytes (Muller et al, 1993), a human B-cell line expressing EBV 
antigens (Bertolini et al, 1993), a mouse retrovirus-infected human T-cell line (Molt4-Ma; 
Shimizu et al, 1992), an HTLV-1 transformed human T-celi line (MT-2; Kato et al, 
1995), and fibroblasts derived from human foreskin (Zibert et al, 1995). Thus far, only 
a small fraction of these cells appear infected, in vitro infectivity of different HCV 
inocula using a permissive subclone of the Molt4-Ma T-cell line correlates well with their 
in vivo infectivity in the chimpanzee model (Shimizu et al, 1993). This cell line has also 
been used to begin examining HCV binding and the possible emergence of neutralization 
escape mutants during chronic infection (Shimizu et al, 1994b). 

Genome structure. Full-length or nearly full-length genome sequences of 
numerous HCV isolates have been reported (see Lin et al, 1994; Okamoto et al, 1994; 
Sakamoto et al, 1994 and citations therein). Given the considerable genetic divergence 
among isolates, it is clear that several major HCV genotypes are distributed throughout 
the world (see below). Those of greatest importance in the U.S. are genotype 1, 
subtypes la and 1b. HCV genome RNAs are about 9.4 kilobases in length. The 5' NTR 
is 341-344 bases and is the most conserved RNA sequence element in the HCV 



WO 97/08310 



PCT/US96/14033 



5 

genome. The length of the long ORF varies slightly among isolates, encoding 
polyproteins of 3010, 3011 or 3033 amino acids. The reported 3' NTR structures show 
considerable diversity both in composition and length (28-42 bases), and appear to 
terminate with poly (U) (for examples, see Chen et al, 1992; Okamoto et ai, 1991; 
Tokita et al, 1994) except in one case (HCV-1, type 1a) which appears contain a 3' 
terminal poly (A) tract (Han et al, 1991). 

Translation and proteolytic processing. Several studies have used cell-free 
translation and transient expression in cell culture to examine the role of the 5' NTR in 
translation initiation (Fukushi et al, 1994; Tsukiyama-Kohara et al, 1992; Wang et al, 
1993; Yoo et al, 1992). This highly conserved sequence contains multiple short 
AUG-initiated ORFs and shows significant homology with the 5' NTR region of 
pestiviruses (Bukh et al, 1992; Han et al, 1991). A series of stem-loop structures have 
been proposed on the basis of computer modeling and sensitivity to digestion by 
different ribonucleases (Brown et a!, 1992; Tsukiyama-Kohara et al, 1992). Although 
still controversial (see Wang et al, 1993; Yoo et al, 1992), the results from several 
groups indicate that this element functions as an internal ribosome entry site (IRES) 
allowing efficient translation initiation at the first AUG of the long ORF (Fukushi et al, 
1994; Tsukiyama-Kohara et al, 1992; Wang et al, 1993). Some of the predicted 
features of the HCV and pestivirus IRES elements are similar to one another (Brown et 
al, 1992). It has been proposed that the 5' terminal hairpin structure and the short 
ORFs may function to downregulate translation (Yoo et al, 1992). The ability of this 
element to function as an IRES suggests that HCV genome RNAs may lack a 5' cap 
structure. 

The organization and processing of the HCV poiyprotein appears to be most similar to 
that of the pestiviruses. At least 10 polypeptides have been identified and the order of 
these cleavage products in the poiyprotein is NH 2 -C-E1-E2-p7-NS2-NS3-NS4A-NS4B- 
NS5A-NS5B-COOH. Proteolytic processing is mediated by host signal peptidase and 
two HCV-encoded proteinases, the NS2-3 autoproteinase and the NS3-4A serine 
proteinase. C is a basic protein believed to be the viral core or capsid protein; E1 and 
E2 are putative virion envelope glycoproteins; p7 is a hydrophobic protein of unknown 
function that is inefficiently cleaved from the E2 glycoprotein (Lin et al, 1994; 
Mizushima et al, 1994; Selby et al, 1994), and NS2-NS5B are likely nonstructural (NS) 
proteins which function in viral RNA replication complexes. In particular, besides its 
N-terminal serine proteinase domain, NS3 contains motifs characteristic of RNA 
helicases and has been shown to possess an RNA-stimulated NTPase activity (Suzich et 
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al, 1993); NS5B contains the GOD motif characteristic of the RNA-dependent RNA 
polymerases of positive-strand RNA viruses. 



Virion assembly and release. This process has not been examined directly, but 
5 the lack of complex glycans, the ER localization of expressed HCV glycoproteins 

(Dubuisson et al, 1 994; Ralston et al, 1 993) and the absence of these proteins on the 
cell surface (Dubuisson et al, 1 994; Spaete et al, 1 992) suggest that initial virion 
morphogenesis may occur by budding into intracellular vesicles. Thus far, efficient 
particle formation and release has not been observed in transient expression assays, 

10 suggesting that essential viral or host factors are missing or blocked. HCV virion 
formation and release may be inefficient, with a substantial fraction of the virus 
remaining cell-associated, as found for the pestiviruses. A recent study indicates that 
extracellular HCV particles partially purified from human plasma do contain complex 
N-linked glycans, although these carbohydrate moieties were not shown to be 

15 specifically associated with E1 or E2 (Sato et al, 1993). Complex glycans associated 
with glycoproteins on released virions would suggest transit through the trans Goigi and 
movement of virions through the host secretory pathway. If this suggestion is correct, 
intracellular sequestration of HCV glycoproteins and virion formation might then play a 
role in the establishment of chronic infections by minimizing immune surveillance and 

20 preventing lysis of virusnnfected cells via antibody and complement. 



Genetic variability. As for all positive-strand RNA viruses, the RNA-dependent 
RNA polymerase of HCV (NS5B) is believed to lack a 3'-5'exonuciease proof reading 
activity for removal of (Disincorporated bases. Replication is therefore error-prone 

25 leading to a "quasispecies" virus population consisting of a large number of variants 

(Martell et al, 1992; M artel I et al, 1994). This variability is apparent at multiple levels. 
First, in a chronically infected individual changes in the virus population occur over time 
(Ogata et al, 1991; Okamoto et al, 1992) and these changes may have important 
consequences for disease. A particularly interesting example is the N-terminal 30 

30 residues of the E2 glycoprotein which exhibits a much higher degree of variability than 
the rest of the polyprotein ' f or examples, see Higashi et al, 1993; Hijikata et al, 1991; 
Weiner et al, 1991). There i accumulating evidence that this hypervariable region, 
perhaps analogous to the VJ domain of HIV-1 gp120, may be under immune selection 
by circulating antiviral antibodies (Kato et al, 1993; Taniguchi et at, 1993; Weiner et al, 

35 1992). In this model, antibodies directed against this portion of E2 may contribute to 
virus neutralization and thus drive the selection of variants with substitutions which 
escape neutralization. This plasticity suggests that a specific amino acid sequence in 
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the E2 hypervariable region is not essential for other functions of the protein such as 
virion attachment, penetration, or assembly. Genetic variability may also contribute to 
the spectrum of different responses observed after treatment of chronically infected 
patients with alpha IFN. Diminished serum ALT levels and improved liver histology, 
5 which is sometimes correlated with a decrease in the level of circulating HCV RNA, is 
seen in only -40% of those treated (Greiser-Wilke et a!, 1991). After treatment, 
approximately 70% of the responders relapse. In some cases, after a transient loss of 
circulating viral RNA, renewed viremia is observed even during the course of treatment. 
While this might suggest the existence or generation of IFN-resistant HCV genotypes or 
0 variants, further work is needed to determine the relative contributions of virus 

genotype and host-specific differences in immune responsiveness. Finally, sequence 
comparisons of different HCV isolates around the world have uncovered enormous 
genetic diversity (reviewed in Ref. Bukh et al, 1995). Because biologically relevant 
serological assays such as cross-neutralization tests are lacking, HCV types (designated 
5 by numbers), subtypes (designated by letters), and isolates are currently being grouped 
on the basis of nucleotide or amino acid sequence similarity. Amino acid sequence 
similarity between the most divergent genotypes can be as little as - 50%, depending 
upon the protein being compared. This diversity is likely to have important biological 
implications, particularly for diagnostics, vaccine design, and therapy. As mentioned 
earlier, genotypes 1a and 1b are most common in the U.S. {see Bukh et al, 1995 for a 
discussion of 

genotype prevalence and distribution). Recently, in Yoo et al (1995) T7 transcripts from 
various derivatives of an HCV-1 cDNA clone were tested for their ability to replicate by 
transfection of the human hepatoma cell line, Huh7. Possible HCV replication was 
assessed by strand-specific RT/PCR (using 5' NTR primers) and metabolic labeling of 
HCV-specific RNAs with 3H-uridine. Transcripts terminating with either poly (A) or poly 
(U), were positive by these assays but those with a deletion of the 5' terminal 144 
bases were not. In some cultures, HCV-specific RNA could be detected in the culture 
media and could be used to reinfect fresh Huh7 cells. While these claims cannot be 
directly refuted, it seems likely that the authors are not actually detecting authentic 
HCV replication. For instance, the authors' positive control was productive transfection 
of Huh7 cells with RNA extracted from 1 ml of high HCV titer chimpanzee plasma. This 
extracted sample would contain a maximum of 107 potentially infectious full-length 
HCV RNA molecules. Under optimum transfection conditions (other than 
microinjection), >105 RNA molecules of virion RNA (at least for poiiovirus, Sindbis 
virus, or YF) are typically required to initiate a single infectious event. This suggests 
that in the HCV-1 experiment fewer than 100 cells would be productively transfected. 
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At 16 days post-transfection, both positive- and negative-strand RNAs were readily 
detected after 8 hours of metabolic labeling. The detection of negative-strand RNA by 
this method (both for transfected virion RNA and transcript RNA) suggests that HCV is 
capable of both efficient replication and spread, and that the level of HCV RNA 
5 synthesis is similar to that which would be expected for a more robust flavi virus, such 
as YF (at the peak of a high multiplicity infection). However, despite numerous 
attempts, the authors were unable to detect HCV antigens in these cells using a variety 
of antisera or full-length positive- or negative-strands by Northern analysis (which is 
much more sensitive than metabolic labeling with 3H-uridine) (J. Han, personal 

10 communication). To say the least, these results are perplexing and not easily reconciled 
with authentic HCV replication. Finally, the critical experiment, demonstrating that RNA 
or virus derived from the HCV-1 clone is infectious in the chimpanzee model, has not 
been reported (despite the initial presentation of this work at a meeting more than two 
years ago). Work in other RNA virus systems has shown that specific terminal 

1 5 sequences can be critical for the generation of functional, replication competent RNAs 
(reviewed in Boyer and Haenni, 1994). Such sequences are believed to be involved in 
initiation of negative- and positive-strand RNA synthesis. In some cases, a few 
additional bases, or even longer non-virai sequences, are tolerated at the 5' and 3' 
termini; these sequences are typically lost or selected against during authentic viral 

20 replication. For other RNA viruses, extra bases, particularly at the 5' terminus, are 

deleterious (Boyer and Haenni, 1994). In contrast, except in a few cases, transcripts 
lacking authentic terminal sequences are non-functional (Boyer and Haenni, 1994). For 
instance, deletion of the 3' terminal secondary structure or conserved sequence 
elements in the 3' NTR of flavivirus genome RNA is lethal for YF (P. J. Bredenbeek and 

25 C.M.R., unpublished) or TBE (C. Mandl, personal communication) RNA replication. 
Given the importance of these sequence elements for other viruses, it is clear that a 
more rigorous determination of the HCV terminal sequences needed to be made. 

SUMMARY OF THE INVENTION 

30 In view of the aforementioned deficiencies attendant with prior art HCV cDNA clones 
and cell culture systems for the analysis of HCV replication, and for the development of 
therapeutic compositions therefor, it is evident that there exists a need in the art for 
identification of particularly the 3' terminal sequence of HCV which can be incorporated 
into a full-length cDNA clone capable of yielding infectious RNA transcripts, which then 

35 can be used as target sequences for the production of attenuated HCV for vaccines, 
and which can be used as targets for therapeutic compositions. 
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In accordance with the present invention, nucleotide sequences derived from cloned 
cDNA are provided which encode an HCV 3' terminal RNA element. The newly 
discovered 3' element in particular is highly conserved among HCV genotypes, and is a 
general feature of the HCV RNA genome. 

The present invention includes a poly (UC) tract followed by a 101 nucleotide 3' 
terminal RNA sequence element, and to full-length HCV viral genome RNA or derived 
HCV RNA replicons containing the following sequence: 
5'- poly (UC) - AAUGGUGGCUCCAUCUUAGCCCUAGUCACGGCUA 
GCUGUGAAAGGUCCGUGAGCCGCAUGACUGCAGA 
GAGUGCUGAUACUGGCCUCUCUGCUGAUCAUGU -3' (SEQ ID NO:1). 

The present invention also relates to the complement of this RNA sequence and to full- 
length HCV negative-sense RNAs containing the following sequence: 
5' - 

ACAUGAUCAGCAGAGAGGCCAGUAUCAGCACUCUCUGCAGUCAUGCGGCUCACGGAC 
CUUUCACAGCUAGCCGUGACUAGGGCUAAGAUGGAGCCACCAUU-poly <GA> - 3' (SEQ 
ID NO:2). 

The present invention also relates to the DNA sequence corresponding to the 3' NTR 
element in the positive-sense HCV genome RNA: 
5' - poly - (TC) - 

AATGGTGGCTCCATCTTAGCCCTAGTCACGGCTAGCTGTGAAAGGTCCGTGAGCCGCAT 
GACTGCAGAGAGTGCTGATACTGGCCTCTCTGCTGATCATGT-3' (SEQ ID NO: 3). 

The present invention also relates to the DNA sequence corresponding to the 
complement of the 3' NTR element present in negative-sense HCV RNA: 
5' - 

ACATGATCAGCAGAGAGGCCAGTATCAGCACTCTCTGCAGTCATGCGGCTCACGGACC 
TTTCACAGCTAGCCGTGACTAGGGCTAAGATGGAGCCACCATT-poly <GA)-3' (SEQ ID 
N0:4). 

It should be appreciated, that although this sequence appears to "noncoding," it is 
possible that the sequence encodes a polypeptide of importance for HCV 
replication. There are two short open reading frames in the complement of the 3' 
terminal element which would be at the 5' of the negative-sense RNA followed by poly 
(A). These sequences could be expressed via translation of the 
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negative strand RNA {either a full-length negative-strand or a subgenomic 
RNA). 

In a further embodiment, this element, which extends beyond the previously accepted 
homopolymer tracts of poly(U) or poly (A), can be used to assemble full-length HCV 
cDNA clones for HCV-H and other HCV isolates (genotypes, types and subtypes) (HCV- 
1 acc.# M62321; HC-J1 acc.# D10749; HC-J acc.# D90208; HCV-BK acc.# M58335; 
HCV-H acc.# M67463; HC-J6 acc.# D00944; HC-J8 acc.# D01221; HC-J483 acc.# 
D13558; HC-J491 acc.# D10750; HC-C2 acc.# D10934; HCV-JK acc.# X61596; HCV- 
N acc.# S62220; HCV-T acc.# M84745; HCV-JT acc.# D01 171; HCV-JT acc.# 
D01172; HC-G9 acc.# D14853; HCV-K3a acc.# D28917; NZL1 acc.# D17763; HCV-Tr 
acc.# D26556). Such full-length viruses or derived HCV RNA replicons are useful for 
the study of HCV replication and virus-host interactions as well as for the development 
of screening assays for HCV therapeutics and the evaluation of therapeutic compounds. 

In a still further embodiment, live-attenuated strains of HCV are provided for use as 
vaccines. 



The present invention also relates to a recombinant DNA or RNA molecule or a 
degenerate variant thereof, which encodes the 3' HCV terminal sequence element; 
preferably the nucleic acid contains a sequence with substantially the same nucleotide 
sequence as SEQ ID NOS:1-4 or that shown in FIGURE 3 (SEQ ID NOS:20-24; the parts 
of sequences downstream of the poly (U) tract only), FIGURE 6 (SEQ ID NOS:28-31) 
and FIGURE 8 (SEQ ID NOS:33-36) and sequences forming a secondary structure similar 
to that of Figure 4 (SEQ ID NO:25), or alternatively, a structure which may be formed 
via interaction of the 3' terminal sequence element (or its compliment) with other HCV 
RNA sequences (e.g., 5' terminal sequences or other sequences at or near the 3 # NTR). 

The sequences of the HCV of the present invention or portions thereof, may be 
prepared as probes (or primers for RT-PCR) to screen for complementary sequences and 
related clones in the same or alternate species. The present invention extends to 
probes or primers so prepared that may be provided for screening cDNA libraries, 
plasma or infected cells for HCV. For example, the probes may be prepared 
synthetically and by recombinant DNA with a variety of known vectors, such as the 
phage, plasmid or viral vector. The present invention also includes the preparation of 
plasmids including such vectors, and the use of the DNA/RNA sequences to construct 
vectors expressing antisense RNA or ribozymes which would attack natural or 
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engineered HCV RNAs containing any or all of the sequences set forth in FIGURES 3 
(SEQ ID NOS:20-24), FIGURE 6 (SEQ ID NOS:28-31) and FIGURE 8 (SEQ ID NO.S:33- 
36) or as above as SEQ ID N0S:1-4, derivatives of the sequences, or homologous 
sequences from other HCV types/subtypes. Correspondingly, the preparation of 
antisense RNA and ribozymes are included herein. 

The present invention also includes RNA molecules having the properties noted herein, 
and that display the sequences set forth and described above and selected from SEQ ID 
N0S:1-4, 20-24, 28-31 and 33-36. 

In a further embodiment of the invention, the full nucleotide sequence of the HCV 
containing the above determined sequences may be introduced into an appropriate host. 
The invention accordingly extends to host cells transfected or transformed with the 
cloned HCV sequences and/or RNA derived therefrom, and more particularly, replication 
competent and/or complete DNA/RNA sequences assembled using the sequences, or 
homologous derivatives set forth above. 

According to other preferred features of certain preferred embodiments of the present 
invention, a transiently transfected or stable cell line is provided to produce infectious 
HCV, and attenuated strains of HCV. 

The present invention naturally contemplates several means for preparation of the HCV 
sequences, including as illustrated herein known recombinant techniques, and the 
invention is accordingly intended to cover such synthetic preparations within its scope. 
The isolation of the sequences disclosed herein facilitates the reproduction of not only 
the nucleic acid sequences themselves, but also infectious HCV, and attenuated HCV by 
such recombinant techniques, and accordingly, the invention extends to the wild type 
and attenuated HCVs so prepared from the disclosed sequences, and to transiently 
transfected cells or stable cell lines expressing this sequence, replicating HCV RNA, 
and/or producing virus. 

The invention includes an assay system for screening of potential drugs effective to 
modulate replication of HCV in target cells by interrupting or potentiating the viral life 
cycle. Potentiation would be desirable where stocks of HCV were to be produced, for 
use in experimental as well as therapeutic regimes (i.e., vaccines). In one instance, the 
test drug could be administered to a cellular sample transfected with an infectious HCV, 
cDNA clone or replication-competent RNA, to determine its effect upon the replicative 
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activity of the HCV in the presence of any chemical sample (including DNA or RNA), or 
to the test drug, by comparison with a control. 

The assay system could more importantly be adapted to identify drugs or other entities 
that are capable of binding to the HCV RNA sequences or which bind essential factors 
interacting with these sequences, thereby inhibiting or potentiating replication. Such 
assays would be useful in the development of drugs that would be specific against a 
wide range of HCV isolates, due to the conservation of an important 3' terminal 
sequence motif, identified by SEQ ID NOS:1-4. 

In yet a further embodiment, the invention contemplates antagonists of the activity of 
HCV, in particular, an agent or molecule that inhibits viral replication or transcriptional 
activity in general. In a specific embodiment, the antagonist can be an oligonucleotide 
having the sequence (or its complement) of a portion of a 3' terminal domain of an 
HCV. Such oligonucleotides may be capable of disrupting strand synthesis required for 
viral replication, translation of HCV RNA into protein, or packaging of genome RNA into 
virus particles. 

The diagnostic utility of the present invention extends to the use of the present 3' 
terminal sequence in assays to screen for HCV infection. In particular, probes or PCR 
primers may be produced which are capable of detecting HCV infection in blood, or in 
infected cells. Such probes may be labelled with any detectable label. In the instance 
where a radioactive label, such as the isotopes 3 H, 14 C, 32 P, 35 S, 36 CI, 51 Cr, 57 Co, S8 Co, 
59 Fe, 9 °Y, 125 1, 131 1, and 186 Re are used, known currently available counting procedures 
may be utilized. In the instance where the label is an enzyme, detection may be 
accomplished by any of the presently utilized colorimetric, spectrophotometric, 
fluorospectrophotometric, amperometric or gasometric techniques known in the art. 

The present invention includes an assay system which may be prepared in the form of a 
test kit for the quantitative analysis of the extent of the presence of the HCV 
sequences, or to identify drugs or other agents that may mimic or block the activity of 
such sequences. The system or test kit may comprise a labeled component prepared by 
one of the radioactive and/or enzymatic techniques discussed herein, coupling a label to 
a probe for HCV nucleic acid, or a binding partner thereof, or a binding partner of the 
HCV virion itself, and one or more additional immunochemical reagents, at least one of 
which is a free or immobilized iigand, capable either of binding with the labeled 
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component, its binding partner, one of the components to be determined or their binding 
partner(s). 

In a further embodiment, the present invention relates to certain therapeutic methods 
5 which would be based upon the activity of the HCV sequences(s), or active fragments 
thereof, or upon agents or other drugs determined to possess the same activity. A first 
therapeutic method is associated with the prevention of infection by HCV, in particular 
by providing a vaccine composed of an attenuated HCV, designed by mutating the 
sequence elements disclosed herein. 

10 

More specifically, the therapeutic method generally referred to herein could include the 
method for the treatment of hepatitis or other cellular dysfunctions caused by HCV by 
the administration of pharmaceutical compositions that may comprise effective 
inhibitors of the HCV or its subunits, or other equally effective drugs developed for 
15 instance by a drug screening assay prepared and used in accordance with a further 
aspect of the present invention. For example, drugs or other binding partners to the 
HCV nucleic acid or its encoded proteins, may be administered to inhibit or potentiate 
transcriptional activity. 

20 In particular, HSV or its herein-identified 3' sequence element or fragments thereof,. and 
binding partners thereto could be prepared in pharmaceutical formulations for 
administration in instances wherein interferon therapy is appropriate, such as to treat 
chronic viral hepatitis or other HCV-associated illnesses. 

25 Accordingly, it is a principal object of the present invention to provide a novel 3' 

sequence element of HCV, as well as full-length HCV genomes which encode wild-type 
or attenuated HCV bearing this additional sequence. 

It is a further object of the present invention to provide a method for detecting the 
30 presence of the HCV in mammals in which HCV is suspected to be present. 

It is a further object of the present invention to provide a method and associated assay 
system for screening substances such as drugs, agents and the like, potentially 
effective in combating the adverse effects of the HCV in mammals. 



WO 97/08310 



PCT/US96/14033 



14 

It is a still further object of the present invention to provide a method for the treatment 
of mammals to control the amount or activity of the HCV or fragments thereof, so as to 
alter the adverse consequences of such presence or activity. 

It is a still further object of the present invention to provide a method for the treatment 
of mammals to control the amount or activity of the HCV or its subunits, so as to treat 
or avert the adverse consequences of a pathological state. 

It is a still further object of the present invention to provide pharmaceutical 
compositions for use in therapeutic methods which comprise or are based upon the 
HCV, its sequence elements, their binding partner(s), or upon agents or drugs that 
control the production, or that mimic or antagonize the activities of the HCV. 

Other objects and advantages will become apparent to those skilled in the art from a 
review of the ensuing description which proceeds with reference to the following 
illustrative drawings. 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIGURE 1 depicts a method for determining the 3' terminal sequence of HCV. A 
phosphorylated synthetic oligodeoxynucleotide {"oligo A\* 5'- 

GACTGTTGTGGCCTGCAGGGCCGAATT-3'; SEQ ID NO:5) was ligated to the 3' end of 
the RNA to serve as a specific priming site for cDNA synthesis. A primer for cDNA 
synthesis ("oligo B w ; in 5-TTGAATTCGACCCTGCAGGCCACAACA-3'; SEQ ID NO:6 or 
B{2); 5'-TTGAATTCGGCCCTGCAGGCCACAACAGTC-3'; SEQ ID NO:7) was 
complementary to that used for ligation to the RNA; a second positive-sense primer for 
PCR corresponded to a sequence near the 3' end of the HCV ORF ("oligo C"; 5'- 
CAAGTCGACGGGGAGACATTTATCACAGC-3'; SEQ ID NO:8). 

FIGURE 2 depicts an alignment comparing the portion of the 3' terminal sequence (DNA 
corresponding to the positive sense HCV genome RNA) of HCV-H-AAK [SEQ ID NO:11; 
determined by sequencing of an uncloned DNA fragment which was synthesized by PCR 
using an oligo (dA) primer and "oligo CI between the termination codon of the ORF and 
the poly (U) tract with a partial list of published sequences for other HCV isolates 
(genotypes): HCV-H(1a> isolate (SEQ ID NO:9); HCV-l(la) (SEQ ID NO:10); HCV-J1(1a) 
(SEQ ID NO:12); HCV-BK(lb) (SEQ ID NO:13); HCV-TW(lb) (SEQ ID NO:14); HCV- 
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N(lb) (SEQ ID NO:15); HCV-J6(2a) (SEQ ID NO:16); HCV-J8 (SEQ ID NO:17); HCV- 
N2L1{3a) (SEQ ID N0:18); and HCV-Tr(3b) (SEQ ID N0:19). 

FIGURE 3 depicts the sequence (DNA corresponding to the positive sense HCV genome 
5 RNA) of HCV-H 3' clones including H77-#1,2 (SEQ ID NO:20>, H77-#8 (SEQ ID NO:21), 
H77-#10 (SEQ ID NO:22), H77-#74 (SEQ ID NO:23), H77-#5 (SEQ ID NO:24). 

FIGURE 4 depicts the computer predicted (FOLDRNA, GCG package) secondary 
structure of the 3' end of HCV-H (SEQ ID NO:25). The last 46 nucleotides form a 
10 stable stem-loop structure (predicted energy -25 kCal/mol). 

FIGURE 5 depicts the scheme used for the RT/PCR amplification and cloning of partial 3' 
terminal segments from four different HCV subtypes (1b r 3, 4 and 4a) (SEQ ID NOS:28- 
31). Briefly, oligo D (5'-TAACATGATCAGCAGAGAGGCCAG-3') (SEQ ID NO:26) was 
1 5 annealed to the 3' end of the genomic RNA, and cDNA was synthesized. Next, PCR 
was performed using oligo C (SEQ ID NO:8) and oligo E (5'- 

CTCACGGACCTTTCACAGC-3') (SEQ ID NO:27). The PCR products were cloned and 
sequenced. 

20 FIGURE 6 shows an alignment of sequences (5 '-3') (DNA sequence corresponding to the 
positive-sense HCV genome RNA) determined for 3 segments from HCV subtypes 1b, 3, 
4 and 4a amplified and cloned as described in Figure 5. The termination codon (TGA) is 
shown in bold. 

25 FIGURE 7 depicts a scheme for 3' end oligonucleotide ligation, RT/PCR and cloning of 
the HCV RNA from four different HCV genotypes/subtypes (1b, 3, 4 and 4a). Methods 
were essentially as outline in Figure 1, except that oligo F (5'- 

CC AAG AATTCCCTAGTC ACGGCTAG C- 3 ' ) (SEQ ID NO:32) was used instead of oligo C. 

30 FIGURE 8 shows the sequences (DNA sequence corresponding to the positive-sense 

HCV genome RNA) (SEQ ID NOS:33-36) resulting from the analysis described in FIGURE 
7. Isoiate-specific sequence differences are shown in bold. 

FIGURE 9 depicts a scheme for alternative 3' end primer ligation and RT/PCR for a 
35 genotype 4 isolate of HCV. A different oligonucleotide ("oligo G"; 5'- 

CGCACCCTGTCCGACTACAACATCC-3'; SEQ ID NO:37) was used for the RNA ligation 
step. Primers used for cDNA synthesis ("oligo H"; 
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CAGAATTCTTGTAGTCGGACAGGGTGCG-3'; SEQ ID NO:38) and for PCR ("oligo F" and 
"oligo H") are indicated. 

FIGURE 10 depicts the predicted structure of a full-length HCV genome RNA (precise 
numbers refer to HCV-H). The genome RNA probably initiates with a G residue and 
contains a 5' non-translated region (NTR) of 341 bases. The ORF consists of 9033 
bases encoding a polyprotein of 301 1 amino acid residues. Following the opal (UGA) 
stop codon is a sequence of 40 bases, a poly (U) tract, a polypyrimidine stretch, and a 
novel conserved RNA element of 101 bases. 

DETAILED DESCRIPTION 

In accordance with the present invention there may be employed conventional molecular 
biology, microbiology, and recombinant DNA techniques within the skill of the art. Such 
techniques are explained fully in the literature. See, e.g., Maniatis, Fritsch & Sambrook, 
"Molecular Cloning: A Laboratory Manual" (1989); "Current Protocols in Molecular 
Biology" Volumes Mil [Ausubel, R. M„ ed. (1994)]; "Cell Biology: A Laboratory 
Handbook" Volumes Mil [J. E. Celis, ed. (1994))]; "Current Protocols in Immunology" 
Volumes Mil [Coligan, J. E., ed. (1994)]; "Oligonucleotide Synthesis" (M.J. Gait ed. 
1984); "Nucleic Acid Hybridization" [B.D. Hames & S.J. Higgins eds. (1985)]; 
"Transcription And Translation" IB.D. Hames & S.J. Higgins, eds. (1984)]; "Animal Ceil 
Culture" [R.I. Freshney, ed. (1986)]; "Immobilized Cells And Enzymes" [IRL Press, 
(1986)]; B. Perbal, "A Practical Guide To Molecular Cloning" (1984). 

Therefore, if appearing herein, the following terms shall have the definitions set out 
below. 

The terms "3' terminal sequence element," "3' terminus," *3' sequence element," and 
any variants not specifically listed, may be used herein interchangeably, and as used 
throughout the present application and claims refer to nucleotide sequences having the 
sequence data described herein and presented in SEQ ID N0S:1-4 or that shown 
FIGURE 3 (SEQ ID NOS:20-24; the parts of sequences downstream of the poly (U) tract 
only), FIGURE 6 (SEQ ID NOS:28-31) and FIGURE 8 (SEQ ID NOS:33-36) and the profile 
of properties set forth herein and in the Claims. It should be appreciated that the terms 
"3' terminal sequence element," "3' terminus," "3' sequence element," are meant to 
encompass all of the following sequences: (i) an RNA sequence at the 3' terminus of 
the positive-sense genome RNA; (ii) the complement of this RNA sequence at the 5' 
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terminus of the HCV negative-sense RNA; (iii) the DNA sequence corresponding to the 
positive-sense sequence of the RNA element; and (iv) the DNA sequence corresponding 
to the negative-sense sequence of the RNA element. Examples of such sequences are 
illustrated in SEQ ID N0S:1-4, respectively. Accordingly, nucleotide sequences 
displaying substantially equivalent or altered properties are likewise contemplated. 
These modifications may be deliberate, for example, such as modifications obtained 
through site-directed mutagenesis, or may be accidental, such as those obtained 
through mutations in hosts that are producers of the complex or its named subunits. 
Also, the terms "3' terminal sequence/ "3' terminus," and "3' sequence element," are 
intended to include within their scope nucleic acid molecules specifically recited herein 
as well as all substantially homologous analogs and allelic variations. 

Any amino acid residues described herein are preferred to be in the "L" isomeric form. 
However, residues in the "D" isomeric form can be substituted for any L-amino acid 
residue, as long as the desired functional property of immunogiobulin-binding is retained 
by the polypeptide. NH 2 refers to the free amino group present at the amino terminus of 
a polypeptide. COOH refers to the free carboxy group present at the carboxy terminus 
of a polypeptide. In keeping with standard polypeptide nomenclature, J. Biol. Chem., 
243:3552-59 (1969), abbreviations for amino acid residues are shown in the following 
Table of Correspondence: 

TABLE OF CORRESPONDENCE 



SYMBOL 




AMINO ACID 




1 -Letter 


3-Letter 






Y 


Tyr 




tyrosine 


G 


Gly 




glycine 


F 


Phe 




phenylalanine 


M 


Met 




methionine 


A 


Ala 




alanine 


S 


Ser 




serine 


I 


lie 




isoleucine 


L 


Leu 




leucine 


T 


Thr 




threonine 


V 


Val 




valine 


P 


Pro 




proline 


K 


Lys 




lysine 


H 


His 




histidine 
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glutamine 
glutamic acid 
tryptophan 
arginine 
aspartic acid 
asparagine 
cysteine 

It should be noted that all amino-acid residue sequences are represented herein by 
formulae whose left and right orientation is in the conventional direction of amino- 
terminus to carboxy-terminus. Furthermore, it should be noted that a dash at the 
beginning or end of an amino acid residue sequence indicates a peptide bond to a 
further sequence of one or more amino-acid residues. The above Table is presented to 
correlate the three-letter and one-letter notations which may appear alternately herein. 

A "replicon" is any genetic element (e.g., plasmid, chromosome, virus) that functions as 
an autonomous unit of DNA or RNA replication in vivo; i.e., capable of replication under 
its own control. Bradenbeck and Rice (1992) Semin. Virol. 3:297-310 contains a 
description of RNA replicons. 

A "vector" is a replicon, such as a plasmid, phage or cosmid, to which another DNA (or 
RNA) segment may be attached so as to bring about the replication of the attached 
segment. 

A "DNA molecule" refers to the polymeric form of deoxyribonucleotides (adenine, 
guanine, thymine, or cytosine) in its either single stranded form, or a double-stranded 
helix. This term refers only to the primary and secondary structure of the molecule, and 
does not limit it to any particular tertiary forms. Thus, this term includes double- 
stranded DNA found, inter alia, in linear DNA molecules (e.g., restriction fragments), 
viruses, plasmids, and chromosomes. In discussing the structure of particular double- 
stranded DNA molecules, sequences may be described herein according to the normal 
convention of giving only the sequence in the 5' to 3' direction along the nontranscribed 
strand of DNA (i.e., the strand having a sequence homologous to the mRNA). 

An "RNA molecule" refers to the polymeric form of ribonucleotides (adenine, guanine, 
uridine, or cytosine) in its either single stranded for, or a double-stranded helix. This 
term refers only to the primary and secondary structure o the molecule, and does not 
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W 
R 
D 
N 
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Gin 
Glu 
Trp 
Arg 
Asp 
Asn 
Cys 
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limit it to any particular tertiary forms. Thus, this term includes single-stranded and 
double-stranded RNA found, inter alia, in linear or circular RNA molecules. In discussing 
the structure of particular RNA molecules, sequence may be described herein according 
to the normal convention of giving the sequence in the 5' to 3' direction. 

An "origin of replication" refers to those DNA sequences that participate in DNA 
synthesis. 

A "coding sequence" or "open reading frame" is a nucleotide sequence which is 
transcribed and translated into a polypeptide in vivo when placed under the control of 
appropriate regulatory sequences. The boundaries of the coding sequence are 
determined by a start codon at the 5' (amino) terminus and a translation stop codon at 
the 3' (carboxyl) terminus. A coding sequence can include, but is not limited to, 
prokaryotic sequences, cDNA from eukaryotic mRNA, genomic DNA sequences from 
eukaryotic (e.g., mammalian) DNA, and even synthetic DNA or RNA sequences. A 
polyadenylation signal and transcription termination sequence will usually be located 3' 
to the coding sequence. 

Transcriptional and transiational control sequences are DNA regulatory sequences, such 
as promoters, enhancers, polyadenylation signals, terminators, and the like, that provide 
for the expression of a coding sequence in a host cell. 

A "promoter sequence" is a DNA regulatory region capable of binding RNA polymerase 
in a cell and initiating transcription of a downstream (3' direction) coding sequence. For 
purposes of defining the present invention, the promoter sequence is bounded at its 3' 
terminus by the transcription initiation site and extends upstream (5' direction) to 
include the minimum number of bases or elements necessary to initiate transcription at 
levels detectable above background. Within the promoter sequence will be found a 
transcription initiation site (conveniently defined by mapping with nuclease SI), as well 
as protein binding domains (consensus sequences) responsible for the binding of RNA 
polymerase. Eukaryotic promoters will often, but not always, contain "TATA" boxes 
and "CAT" boxes. Prokaryotic promoters contain Shine-Dalgarno sequences in addition 
to the -10 and -35 consensus sequences. Promoter sequences can also be used to refer 
to analogous RNA sequences or structures of similar function in RNA virus replication 
and transcription. 
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An "expression control sequence" is a DNA sequence that controls and regulates the 
transcription and translation of another DNA sequence. A coding sequence is "under 
the control" of transcriptional and transiational control sequences in a cell when RNA 
polymerase transcribes the coding sequence into mRNA, which is then translated into 
the protein encoded by the coding sequence. RNA sequences can also serve as 
expression control sequences by virtue of their ability to modulate translation, RNA 
stability, and replication (for RNA viruses). 

A "signal sequence** can be included before the coding sequence. This sequence 
encodes a signal peptide, N-terminal to the polypeptide, that communicates to the host 
cell to direct the polypeptide to the cell surface or secrete the polypeptide into the 
media, and this signal peptide is clipped off by the host cell before the protein leaves 
the cell. Signal sequences can be found associated with a variety of proteins native to 
prokaryotes and eukaryotes. 

The term "oligonucleotide," as used herein in referring to the probe of the present 
invention, is defined as a molecule comprised of two or more ribonucleotides, preferably 
more than three. Its exact size will depend upon many factors which, in turn, depend 
upon the ultimate function and use of the oligonucleotide. 

The term "primer" as used herein refers to an oligonucleotide, whether occurring 
naturally as in a purified restriction digest or produced synthetically, which is capable of 
acting as a point of initiation of synthesis when placed under conditions in which 
synthesis of a primer extension product, which is complementary to a nucleic acid 
strand, is induced, i.e., in the presence of nucleotides and an inducing agent such as a 
DNA polymerase and at a suitable temperature and pH. The primer may be either 
single-stranded or double-stranded and must be sufficiently long to prime the synthesis 
of the desired extension product in the presence of the inducing agent. The exact 
length of the primer will depend upon many factors, including temperature, source of 
primer and use of the method. For example, for diagnostic applications, depending on 
the complexity of the target sequence, the oligonucleotide primer typically contains 1 5- 
25 or more nucleotides, although it may contain fewer nucleotides. 

The primers herein are selected to be "substantially" complementary to different strands 
of a particular target DNA sequence. This means that the primers must be sufficiently 
complementary to hybridize with their respective strands. Therefore, the primer 
sequence need not reflect the exact sequence of the template. For example, a non- 
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complementary nucleotide fragment may be attached to the 5' end of the primer, with 
the remainder of the primer sequence being complementary to the strand. Alternatively, 
non-complementary bases or longer sequences can be interspersed into the primer, 
provided that the primer sequence has sufficient complementarity with the sequence of 
5 the strand to hybridize therewith and thereby form the template for the synthesis of the 
extension product. 

As used herein, the terms "restriction endonucleases" and "restriction enzymes" refer to 
bacterial enzymes, each of which cut double-stranded DNA at or near a specific 
0 nucleotide sequence. 

A cell has been "transformed" by exogenous or heterologous DNA when such DNA has 
been introduced inside the cell. The transforming DNA may or may not be integrated 
(covalently linked) into chromosomal DNA making up the genome of the cell. In 
5 prokaryotes, yeast, and mammalian cells for example, the transforming DNA may be 
maintained on an episomal element such as a plasmid. With respect to eukaryotic cells, 
a stably transformed cell is one in which the transforming DNA has become integrated 
into a chromosome so that it is inherited by daughter cells through chromosome 
replication. This stability is demonstrated by the ability of the eukaryotic cell to 
establish cell lines or clones comprised of a population of daughter cells containing the 
transforming DNA. A "clone" is a population of cells derived from a single cell or 
common ancestor by mitosis. A "cell line" is a clone of a primary cell that is capable of 
stable growth in vitro for many generations. This definition can be applied to RNA * 
molecules which can be used to transform or "transfect" cells. For some RNA viruses, 
such methods can be used to produce infected cell lines which transiently or 
continuously support virus replication and, in some cases, which produce infectious viral 
particles. 



Two DNA or RNA sequences are "substantially homologous" when at least about 75% 
(preferably at least about 80%, and most preferably at least about 90 or 95%) of the 
nucleotides match over the defined length of the DNA sequences. Sequences that are 
substantially homologous can be identified by comparing the sequences using standard 
software available in sequence data banks, or in a Southern hybridization experiment 
under, for example, stringent conditions as defined for that particular system. Defining 
appropriate hybridization conditions is within the skill of the art. See, e.g., Maniatis et 
al, supra; DNA Cloning, Vols. I fit II, supra; Nucleic Acid Hybridization, supra. More 
distantly related sequences or structures, which may have the same or similar functions. 
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are referred to as "homologous." In the extreme case, such sequences could be 
unrelated in terms of linear sequence identity, but may have substantially similar 
secondary structure. 

A -heterologous" region of a DNA or RNA construct is an identifiable segment of DNA 
or RNA molecule within a larger nucleic acid that is not found in association with the 
larger molecule in nature. For instance, when the heterologous region encodes a 
mammalian gene, the gene will usually be flanked by DNA that does not flank the 
mammalian genomic DNA in the genome of the source organism. Another example of a 
heterologous coding sequence is a construct where the coding sequence itself is not 
found in nature (e.g., a cDNA where the genomic coding sequence contains introns, or 
synthetic sequences having codons different than the native gene). Allelic variations or 
naturally-occurring mutational events do not give rise to a heterologous region of DNA 
as defined herein. 

An "antibody" is any immunoglobulin, including antibodies and fragments thereof, that 
binds a specific epitope. The term encompasses polyclonal, monoclonal, and chimeric 
antibodies, the last mentioned described in further detail in U.S. Patent Nos. 4,816,397 
and 4,816,567. 

An "antibody combining site" is that structural portion of an antibody molecule 
comprised of heavy and light chain variable and hypervariable regions that specifically 
binds antigen. 

The phrase "antibody molecule" in its various grammatical forms as used herein 
contemplates both an intact immunoglobulin molecule and an immunologically active 
portion of an immunoglobulin molecule. 

Exemplary antibody molecules are intact immunoglobulin molecules, substantially intact 
immunoglobulin molecules and those portions of an immunoglobulin molecule that 
contains the paratope, including those portions known in the art as Fab, Fab', F(ab') 2 
and F(v) f which portions are preferred for use in the therapeutic methods described 
herein. 

Fab and F(ab') 2 portions of antibody molecules are prepared by the proteolytic reaction 
of papain and pepsin, respectively, on substantially intact antibody molecules by 
methods that are well-known. See for example, U.S. Patent No. 4,342,566 to 
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Theofilopoious et al Fab' antibody molecule portions are also well-known and are 
produced from F(ab') 2 portions followed by reduction of the disulfide bonds linking the 
two heavy chain portions as with mercaptoethanol, and followed by alkylation of the 
resulting protein mercaptan with a reagent such as iodoacetamide. An antibody 
5 containing intact antibody molecules is preferred herein. 

The phrase "monoclonal antibody" in its various grammatical forms refers to an 
antibody having only one species of antibody combining site capable of immunoreacting 
with a particular antigen. A monoclonal antibody thus typically displays a single binding 
10 affinity for any antigen with which it immunoreacts. A monoclonal antibody may 

therefore contain an antibody molecule having a plurality of antibody combining sites, 
each immunospecific for a different antigen; e.g., a bispecific (chimeric) monoclonal 
antibody. 

15 The phrase "pharmaceutical^ acceptable" refers to molecular entities and compositions 
that are physiologically tolerable and do not typically produce an allergic or similar 
untoward reaction, such as gastric upset, dizziness and the like, when administered to a 
human. 

20 The phrase "therapeutically effective amount" is used herein to mean an amount 
sufficient to prevent, and preferably reduce by at least about 30 percent, more 
preferably by at least 50 percent, most preferably by at least 90 percent, a clinically 
significant change in the S phase activity of a target cellular mass, or other feature of 
pathology such as for example, elevated blood pressure, fever or white cell count as 

25 may attend its presence and activity. 

A DNA sequence is "operatively linked" to an expression control sequence when the 
expression control sequence controls and regulates the transcription and translation of 
that DNA sequence. The term "operatively linked" includes having an appropriate start 

30 signal (e.g., ATG or AUG) in front of the DNA sequence to be expressed and 

maintaining the correct reading frame to permit expression of the DNA sequence under 
the control of the expression control sequence and production of the desired product 
encoded by the DNA sequence. If a gene that one desires to insert into a recombinant 
DNA molecule does not contain an appropriate start signal, such a start signal can be 

35 inserted in front of the gene. 
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The term "standard hybridization conditions" in general refers to salt and temperature 
conditions substantially equivalent to 5 x SSC and 65 °C for both hybridization and 
wash. However, one skilled in the art will appreciate that such "standard hybridization 
conditions" are dependent on particular conditions including the concentration of sodium 

5 and magnesium in the buffer, nucleotide sequence length and concentration, percent 
mismatch, percent formamide, and the like. Also important in the determination of 
"standard hybridization conditions" is whether the two sequences hybridizing are RNA- 
RNA, DNA-DNA or RNA-DNA. Such standard hybridization conditions are easily 
determined by one skilled in the art according to well known formulae, wherein 

0 hybridization is typically 10-20°C below the predicted or determined T m with washes of 
higher stringency, if desired. 

By "HCV is meant a diverse group of related viruses classified as a separate genus in 
the flavivirus family. The characteristics of this genus are described in the Background 
5 of the Invention above, and include such members as HCV-1, HC-J1, HC-J, HCV-BK, 
HCV-H, HC-J6, HC-J8, HC-J483, HC-J491, HC-C2, HCV-JK, HCV-N, HCV-T, HCV-JT, 
HC-G9, HCV-K3a, NZL1, HCV-Tr and the like. 

In its primary aspect, the present invention concerns the identification of novel terminal 
sequences present in HCV positive-sense genomic RNA. 

In a particular embodiment, the present invention relates to novel 5' and 3' terminal 
sequences which are highly conserved by all members of the herein disclosed HCVs. 

As stated above, the present invention also relates to a recombinant DNA or RNA 
molecule or cloned gene, or a degenerate variant thereof, which encodes an HCV, or a 
fragment thereof, which has a nucleotide sequence or is complementary to a nucleotide 
sequence shown in SEQ ID N0S:1-4 or that shown FIGURE 3 (SEQ ID NOS:20-24; the 
parts of sequences downstream of the poly (U) tract only), FIGURE 6 (SEQ ID NOS:28- 
31) and FIGURE 8 (SEQ ID NOS:33-36). 

The invention also relates to infectious HCV cDNA clones comprising previously 
disclosed 5' non-coding, coding and 3' non-coding sequences including those encoding 
poly (U) or poly (A) tracts ((HCV-1 acc.# M62321; HC-J1 acc.# D10749; HC-J acc.# 
D90208; HCV-BK acc.# M58335; HCV-H acc.# M67463; HC-J6 acc.# D00944; HC-J8 
acc.# D01221; HC-J483 acc.# D13558; HC-J491 acc.# D1O750; HC-C2 acc.# 
D10934; HCV-JK acc.# X61596; HCV-N acc.# S62220; HCV-T acc.# M84745; HCV- 
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JT acc.# D01171; HCV-JT acc.# D01172; HC-G9 acc.# D14853; HCV-K3a acc.# 
D28917; NZL1 acc.# D17763; HCV-Tr acc.# D26556 and others), the polypyrimidine 
tract, and the novel 3' element of 101 bases (SEQ ID NO: 1c, d or related sequences). 

The possibilities both diagnostic and therapeutic that are raised by the existence of the 
full length HCV clone, derive from the fact that the terminal sequences of viral genomes 
can be critical for the generation of functional, replication competent RNAs (Boyer et al, 
J. Gen. ViroL 198:415-426). As suggested earlier and elaborated further on herein, the 
present invention contemplates pharmaceutical intervention in the infectious life cycle of 
HCV. 

Thus, in instances where it is desired to inhibit infectivity of HCV, an appropriate 
inhibitor of the 3' sequence element could be introduced to block the interaction of the 
initiation of negative- and positive-strand synthesis required for viral replication. 
Correspondingly, infectivity may be remedied by the introduction of additional quantities 
of a nucleic acid molecule encoding the 3' sequence element or its chemical or 
pharmaceutical cognates, analogs, fragments and the like. 

As discussed earlier, the molecules or agents exhibiting either mimicry or antagonism to 
the 3' sequence element, or control over the replication of the HCV, may be prepared in 
pharmaceutical compositions, with a suitable carrier and at a strength effective for 
administration by various means to a patient experiencing an adverse medical condition 
associated with HCV for the treatment thereof. A variety of administrative techniques 
may be utilized, among them parenteral techniques such as subcutaneous, intravenous 
and intraperitoneal injections, catheterizations and the like. Average quantities of the 
molecules or their subunits may vary and in particular should be based upon the 
recommendations and prescription of a qualified physician or veterinarian. 

As suggested earlier, the diagnostic method of the present invention comprises 
examining a cellular sample or medium by means of an assay including an effective 
amount of a binding partner to the HCV RNA. As previously discussed, patients capable 
of benefiting from this method include those suffering from infection by HCV. Methods 
for isolating the molecules which bind HCV sequences to assist in the examination of 
the target cells are all well-known in the art. 

The present invention further contemplates therapeutic compositions useful in practicing 
the therapeutic methods of this invention. A subject therapeutic composition includes, 



WO 97/08310 



PGT/US96/14033 



26 

in admixture, a pharmaceutical^ acceptable excipient (carrier) and one or more of a 3' 
terminal sequence element, analog thereof or fragment thereof, or a molecule which 
inhibits the properties or activities of that sequence element, as described herein as an 
active ingredient. In a preferred embodiment, the composition comprises a molecule 
5 capable of modulating the secondary structure formation of the 3' terminus of the HCV 
RNA, and/or the initiation of negative- and positive-strand RNA synthesis and/or RNA 
packaging. 

A further therapeutic composition includes a full-length attenuated HCV which can be 
0 used as a vaccine against HCV infection, wherein a sequence within the genome of 
HCV, in particular the 3' terminal sequence element (SEQ ID N0S:1-4) has been 
modified, either as a naturally-occurring isolate, or via in vitro evaluation or site-directed 
mutagenesis. Particular mutations suitable for such attenuated viruses include those 
which alter the structure of the 3' terminus of the HCV genome RNA (or the 5' terminus 
of negative sense HCV RNA), and, by so doing, alter the initiation and/or translation of 
negative- and positive-strand RNA synthesis and/or RNA packaging. Such modifications 
may be aided by computer modelling and evaluated using infectivity assays. 

The preparation of therapeutic compositions which contain the 3' terminal sequence 
element or antagonist -thereof, or the full-length attenuated HCV, analogs or active 
fragments as active ingredients is well understood in the art. Typically, such 
compositions are prepared as injectables, either as liquid solutions or suspensions, 
however, solid forms suitable for solution in, or suspension in, liquid prior to injection 
can also be prepared. The preparation can also be emulsified. The active therapeutic 
ingredient is often mixed with excipients which are pharmaceutical^ acceptable and 
compatible with the active ingredient. Suitable excipients are, for example, water, 
saline, dextrose, glycerol, ethanol, or the like and combinations thereof. In addition, if 
desired, the composition can contain minor amounts of auxiliary substances such as 
wetting or emulsifying agents, pH buffering agents which enhance the effectiveness of 
the active ingredient. 

A polypeptide, analog or active fragment can be formulated into the therapeutic 
composition as neutralized pharmaceutical^ acceptable salt forms. Pharmaceutical^ 
acceptable salts include the acid addition salts (formed with the free amino groups of 
the polypeptide or antibody molecule) and which are formed with inorganic acids such 
as, for example, hydrochloric or phosphoric acids, or such organic acids as acetic, 
oxalic, tartaric, mandelic, and the like. Salts formed from the free carboxyl groups can 
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also be derived from inorganic bases such as. for example, sodium, potassium, 
ammonium, calcium, or ferric hydroxides, and such organic bases as isopropylamine, 
trimethylamine, 2-ethylamino ethanol, histidine, procaine, and the like. 

5 The therapeutic compositions are conventionally administered intravenously, as by 

injection of a unit dose, for example. The term "unit dose" when used in reference to a 
therapeutic composition of the present invention refers to physically discrete units 
suitable as unitary dosage for humans, each unit containing a predetermined quantity of 
active material calculated to produce the desired therapeutic effect in association with 
10 the required diluent; i.e., carrier, or vehicle. 

The compositions are administered in a manner compatible with the dosage formulation, 
and in a therapeutically effective amount. The quantity to be administered depends on 
the subject to be treated, capacity of the subject's immune system to utilize the active 

15 ingredient, and degree of inhibition or neutralization of HCV desired. Precise amounts of 
active ingredient required to be administered depend on the judgment of the practitioner 
and are peculiar to each individual. However, suitable dosages may range from about 
0.1 to 20, preferably about 0.5 to about 10, and more preferably one to several, 
milligrams of active ingredient per kilogram body weight of individual per day and 

20 depend on the route of administration. In the case of attenuated virus used as a 
vaccine, dosages could range from 10 to 10 6 infectious doses. For inactivated viral 
vaccines, higher doses of HCV antigen and a suitable adjuvant could be required. 
Suitable regimes for initial administration and booster shots are also variable, but are " 
typified by an initial administration followed by repeated doses at one or more hour 

25 intervals by a subsequent injection or other administration. Alternatively, continuous 
intravenous infusion sufficient to maintain concentrations of ten nanomolar to ten 
micromolar in the blood are contemplated. 

The therapeutic compositions may further include an effective amount of the 3' 
30 sequence element, its antagonist, or analog thereof, in combination with an antibiotic, a 
steroid, interferon or other anti-HCV therapeutic. Exemplary formulations are given 
below: 
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Formulations 



Intravenous Formulation I 

Ingredient mo/ml 

cefotaxime 250.0 

HCV, fragment or antagonist 10.0 

dextrose USP 45 0 

sodium bisulfite USP 3.2 

edetate disodium USP 0.1 

water for injection q.s.a.d. 1.0 ml 



Intravenous Formulation II 

Ingredient 

ampiciilin 

HCV, fragment or antagonist 
sodium bisulfite USP 
disodium edetate USP 
water for injection q.s.a.d. 



Intravenous Formulation III 

Ingredient mq/ml 

gentamicin (charged as sulfate) 40.0 

HCV, fragment or antagonist 10.0 

sodium bisulfite USP 3.2 

disodium edetate USP 0.1 

water for injection q.s.a.d. 1.0 ml 

Intravenous Formulation IV 

Ingredient mo/ml 

HCV, fragment or antagonist 10.0 

dextrose USP 45 0 

sodium bisulfite USP 3.2 

edetate disodium USP 0.1 

water for injection q.s.a.d. 1 .0 ml 



mq/ml 

250.0 

10.0 

3.2 

0.1 

1.0 ml 
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Intravenous Formulation V 



Ingredient 



mo/ml 



HCV, fragment or antagonist 
sodium bisulfite USP 



3.2 



5.0 



disodium edetate USP 



0.1 



water for injection q.s.a.d. 



1.0 ml 



As used herein, n pg w means picogram, "ng" means nanogram, n ug n or Vg" mean 
microgram, "mg" means milligram, n ur or >T mean microliter, "ml" means milliliter, T 
means liter. 

A wide variety of cell types may be useful as host cells for HCV replication, as initiated 
using the functional HCV cDNA clones created by incorporation of sequences from the 
present invention. These host cells may include primary human cells (e.g., hepatocytes, 
T-cells, B-cells, monocytes/macrophages, foreskin fibroblasts) as well as continuous 
human cell lines [e.g., HepG2, Huh7, HUT78, HPB-Ma, MT-2 (and other HTLV-I and 
HTLV-II infected T-cell lines), Namalowa, Daudi, EBV-transformed LCLs). In addition, 
continuous cell lines which are readily transfected with RNA and permissive for 
replication of flaviviruses or pestiviruses may support HCV replication (e.g. SW-13, 
Vero, BHK-21, COS, PK-15, MBCK). One skilled in the art will be able to select the 
proper host cells without undue experimentation to accomplish the desired infectivity 
assay without departing for the scope of this invention. 

It is further intended that 3' terminal sequence analogs or HCV analogs may be prepared 
from nucleotide sequences derived within the scope of the present invention. Analogs, 
such as fragments or mutants (e.g., "muteins,") can be produced by standard cleavage 
by restriction enzymes, or site-directed mutagenesis of the HCV coding and non-coding 
(5' and 3' terminal) sequences. Analogs exhibiting "HCV inhibiting activity" such as 
small molecules, whether functioning as promoters or inhibitors, may be identified by 
known in vivo and/or in vitro assays. 

As mentioned above, a DNA sequence encoding the 3' sequence element, its 
complement, or the full length wild-type or attenuated HCV can be prepared 
synthetically rather than cloned. The DNA sequence can be designed with the 
appropriate codons for amino acid sequence encoded by the HCV open reading frame. 
The complete sequence is assembled from overlapping oligonucleotides prepared by 
standard methods and assembled into a complete coding sequence. See, e.g.. Edge, 
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Nature, 292:756 (1981); Nambair et al, Science, 223:1299 (1984); Jay et al, J. Biol. 
Chem., 255:6311 (1984). 

The ability of the 3' terminal sequence element to drive replication-competent HCV 
5 RNAs can be analyzed by using constructs in which the ORF of HCV has been replaced 
by a reporter gene, such as luciferase, which can be detected directly and correlated to 
the level of HCV RNA in the cell. In particular, the 3' terminal sequence element can be 
used to derive replication competent HCV RNAs, either fulMength RNAs capable of 
complete replication and virus production or replicons. Such repiicons would be capable 
0 of RNA replication, but might lack the structural region/packaging machinery and hence 
not produce virus. Cells transfected/transformed with such replicons (containing the 3' 
element) would be useful for inhibitors of HCV RNA replication, including those which 
might interfere with the function of the 3' element or its complement. RNA replication 
could be assessed either by looking directly at HCV RNA levels (RT/PCR, B-DNA, 
5 Northern blot analyses) or by incorporating a sensitive reporter (like luciferase) under the 
control of the HCV RNA replication and translational machinery. 

Synthetic DNA sequences allow convenient construction of genes which will express 
HCV, HCV variants or attenuated HCV. Alternatively, DNA encoding variant or 
) attenuated HCV can be made by site-directed mutagenesis of native HCV cDNAs. 

A general method for site-specific incorporation of unnatural amino acids into proteins is 
described in Christopher J. Noren, Spencer J. Anthonv-Cahill, Michael C. Griffith, Peter 
G. Schultz, Science. 244:182-188 (April 1989). This method may be used to create 
HCV virions containing proteins with unnatural amino acids. 

The present invention extends to the preparation of antisense nucleotides and ribozymes 
that may be used to interfere with HCV RNA translation, stability, 

replication/transcription and/or packaging. This approach utilizes antisense nucleic acid 
and ribozymes to block viral replication, either by masking the HCV RNA with an 
antisense nucleic acid or cleaving it with a ribozyme. 

Antisense nucleic acids are DNA or RNA molecules that are complementary to at least a 
portion of a specific RNA molecule. (See Weintraub, 1990; Marcus-Sekura, 1988.) In 
the cell, they hybridize to that RNA, forming a double stranded molecule. Therefore, 
antisense nucleic acids may interfere with viral replication (either by a direct blocking 
effect or by leading to degradation of the target RNA by cellular enzymes). Oligomers 



WO 97/08310 



PCT/US96/14033 



31 

of about fifteen nucleotides are appropriate, since they are easy to synthesize and are 
likely to pose fewer problems than larger molecules when introducing them into HCV- 
infectible cells. However, also suitable are oligonucleotides of natural structure or those 
with modifications to enhance stability, facilitate, uptake, etc. In addition, longer 
5 antisense RNAs can be generated in vivo in hepatocytes or other HCV target cells using 
gene therapy approaches. Antisense methods have been used to inhibit the expression 
of many genes in vitro (Marcus-Sekura, 1988; Hambor et al, 1988). 

Ribozymes are RNA molecules possessing the ability to specifically cleave other single 
0 stranded RNA molecules in a manner somewhat analogous to DNA restriction 

endonucleases. Ribozymes were discovered from the observation that certain mRNAs 
have the ability to excise their own introns. By modifying the nucleotide sequence of 
these RNAs, researchers have been able to engineer molecules that recognize specific 
nucleotide sequences in an RNA molecule and cleave it (Cech, 1988.). Because they 
5 are sequence-specific, only RNAs (such as the HCV positive-sense genome RNA or its 
complement) with particular sequences are inactivated. 

Investigators have identified two types of ribozymes, Tetrahymena-type and 
"hammerhead "-type. (Hasselhoff and Gerlach, 1988) Tetrahymena-type ribozymes 
0 recognize four-base sequences, while "hammerhead"-type recognize eleven- to 

eighteen-base sequences. The longer the recognition sequence, the more likely it is to 
occur exclusively in the target mRNA species. Therefore, hammerhead-type ribozymes 
are preferable to Tetrahymena-type ribozymes for inactivating a specific mRNA species, 
and eighteen base recognition sequences are preferable to shorter recognition 
sequences. 

The DNA sequences described herein may thus be used to prepare antisense molecules 
against, and ribozymes that cleave HCV RNA. It should be appreciated that such 
antisense molecules and ribozymes will encompass nucleotide sequences of both 
positive and negative strand polarity, such that they may bind to both the positive and 
negative strand HCV RNAs. 

The present invention also relates to a variety of diagnostic applications, including 
methods for detecting HCV. The invention also relates to methods of correlating 3' 
NTR sequences with various clinical parameters such as disease severity, response to 
treatment with interferon, and immune status, or to determine tissue tropism (predictive 
diagnostics.) 
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The present HCV sequences can be used experimentally to identify HCV isolates 
containing additional 3' sequences, to determine 3' NTR sequences for various HCV 
genotypes to define further areas of conservation and divergence. Also, the chemical 
modification and analysis of the RNA by RNAse mapping and three-dimensional 
structure analysis of the RNA is an aid to identifying host or viral factors which interact 
with the sequence and/or identifying molecules which inhibit replication. 

HCV RNA can also be used therapeutically, i.e., attenuated HCV can be used for 
vaccine development, and the 3' NTR sequence element may be used as a trans- 
dominant inhibitor of HCV replication via gene therapy. 

Replication of HCV in cells can be ascertained by branched DNA (B-DNA), quantitative 
RT/PCR and immunological procedures, or using standard methods for determining virus 
titer (i.e., titration in the chimpanzee). The procedures and their application are all 
familiar to those skilled in the art and accordingly may be utilized within the scope of 
the present invention. A "competitive" antibody binding procedure is described in U.S. 
Patent Nos. 3,654,090 and 3,850,752. A "sandwich" procedure, is described in U.S. 
Patent Nos. RE 31,006 and 4,016,043. Still other procedures are known such as the 
"double antibody", or "DASP" procedure. 

In each instance, HCV proteins form complexes with one or more antibody(ies) or 
binding partners and one member of the complex is labeled with a detectable label. 
Alternatively, an antibody may be raised, or identified in HCV-infected patients, which 
binds to the present HCV 3' terminal sequence element. The fact that a complex has 
formed and, if desired, the amount thereof, can be determined by known methods 
applicable to the detection of labels. 

Also within the scope of the invention are RNA molecules which mimic the 3' terminal 
sequence element or its complement or bind to these elements, selected in vitro using 

the "SELEX" (Tuerk and Gold, 19 ) or other in vitro selection/evolution approaches. 

These methods provide libraries of RNAs with randomized sequences which can be 
selected by reiterative binding to a target (in this case, the 3' terminal sequence 
element, its complement, or the cognate binding partners required for the functions of 
these respective elements), and RNAs bound thereto are amplified by PCR. Such 
molecules may either mimic the structure of the 3' terminal element or be competitive 
inhibitors of the 3' terminal sequence element. Such SELEX RNAs may be suitable for 
diagnostic and even therapeutic uses within the scope of the present invention. 
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Alternatively, the presence of HCV RNA can be determined by Northern analysis, 
polymerase chain reaction (PCR), primer extension, and the like. 

The labels most commonly employed for these studies are radioactive elements, 
enzymes, chemicals which fluoresce when exposed to ultraviolet light, and others. 

A number of fluorescent materials are known and can be utilized as labels. These 
include, for example, fluorescein, rhodamine, auramine, Texas Red, AMCA blue and 
Lucifer Yellow. A particular detecting material is anti-rabbit antibody prepared in goats 
and conjugated with fluorescein through an isothiocyanate. 

An antibody to HCV proteins, or a probe for HCV RNA or their binding partner(s) can 
also be labeled with a radioactive element or with an enzyme. The radioactive label ca 
be detected by any of the currently available counting procedures. The preferred 
isotope may be selected from 3 H, 14 C, 32 P, 35 S, 36 C1, 51 Cr, 57 Co, 58 Co, 59 Fe, 90 Y, 125 l, 131 l 
and 186 Re. 



Enzyme labels are likewise useful, and can be detected by any of the presently utilized 
colorimetric, spectrophotometric, fluorospectrophotometric, amperometric or gasometric 
techniques. The enzyme is conjugated to the selected probe by reaction with bridging 
molecules such as carbodiimides, diisocyanates, glutaraldehyde and the like. Many 
enzymes which can be used in these procedures are known and can be utilized. Those 
preferred are peroxidase, S-glucuronidase, S-D-glucosidase, S-D-galactosidase, urease, 
glucose oxidase plus peroxidase and alkaline phosphatase. U.S. Patent Nos. 
3,654,090; 3,850,752; and 4,016,043 are referred to by way of example for their 
disclosure of alternate labeling material and methods. In addition, a probe may be 
biotin-labelled, and thereafter be detected with labelled avidin, or a combination of 
avidin and a labelled anti-avidin antibody. Probes may also have digoxygenin 
incorporated therein and be then detected with a labelled anti-digoxygenin and detected 
with a labelled anti-digoxygenin antibody. 

In a further embodiment of this invention, commercial test kits suitable for use by a 
medical specialist may be prepared to determine the presence or absence of infectious 
HCV in suspected patient samples. Such test kits may employ techniques such as 
RT/PCR, branched DNA and ligation chain reaction (LCR). In accordance with the 
testing techniques discussed above, one class of such kits will contain at least a labeled 
HCV antibody or oligonucleotide probe or its binding partner, and directions, of course, 
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depending upon the method selected. The kits may also contain peripheral reagents 
such as buffers, stabilizers, etc. 

Accordingly, a test kit may be prepared for the demonstration of the presence of HCV, 
comprising: 

(a) a predetermined amount of at least one labeled oligonucleotide probe directed to 
a 3' sequence element of the HCV genome, obtained by the direct or indirect 
attachment of the oligonucleotide or a specific binding partner thereto, to a detectable 
label; 

(b) other reagents; and 

(c) directions for use of said kit. 

More specifically, the diagnostic test kit may comprise: 

(a) a known amount of the labeled oligonucleotide probe as described above (or a 
binding partner) generally bound to a solid phase to form an immunosorbent, or in the 
alternative, bound to a suitable tag, or plural such end products, etc. (or their binding 
partners) one of each; 

(b) if necessary, other reagents; and 

(c) directions for use of said test kit. 

In accordance with the above, an assay system for screening potential drugs effective 
to modulate the replicative activity of the HCV RNA may be prepared. The infectious 
HCV RNA may be introduced into a test system, and the prospective drug may also be 
introduced into the resulting cell culture, and the culture thereafter examined to observe 
any changes in the replicative activity of the cells, due to the addition of the prospective 
drug. 

PRELIMINARY CONSIDERATIONS 

While a great deal of progress has been made in the last several years, there are still a 
vast number of unanswered questions concerning HCV replication, pathogenesis and 
immunity. The field is rapidly reaching a bottleneck some aspects of the functions of 
the HCV genome RNA and its encoded proteins are understood, but prior to the present 
invention, no way existed of experimentally testing structure/function questions in the 
context of authentic virus replication. Such analyses are critical for understanding each 
step in the virus life cycle at a level which will allows the design of protective vaccines 
and effective therapy for chronically infected patients. 
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The present invention stemmed from attempts to construct functional cONA clones for 
HCV, in particular the HCV-H strain. Prior to the present invention, attempts to recover 
infectious HCV RNA from cDNA have been unsuccessful. Several possible explanations, 
alone or in combination, account for previous failures, including missing or incorrect 
terminal sequences, internal errors deleterious or lethal for HCV replication, or 
inadequate methods for assaying infectivity and replication. 

Rationale for rigorously determining the HCV-H termini. 5' and 3' terminal sequences of 
HCV-H were previously unknown. Previous attempts to generate functional transcripts 
used terminal sequences determined for other HCV isolates. As mentioned above, work 
in other RNA virus systems has shown that specific terminal sequences can be critical 
for the generation of functional, replication competent RNAs (Boyer and Haenni, (1994) 
J. Gen. Virol. 198:415-426). Such sequences are believed to be involved in initiation of 
negative- and positive-strand RNA synthesis. Given the importance of these sequence 
elements for other viruses, the present invention more rigorously determined the HCV-H 
terminal sequences. 

Structure of the 5' NTR. Methods used to amplify and clone the extreme 5' termini of 
RNAs include homopolymer tailing or ligation of synthetic oligonucleotides to first-strand 
cDNA (5' RACE), cyclization of first-strand cDNA followed by inverse PCR (Zeiner and 
Gehrrng (1994) BioTechniques 17:1051-1053), or cyclization of genome RNA with RNA 
ligase (after treatment to remove 5' cap structures, if necessary) followed by cONA 
synthesis and PCR amplification across the 5'-3' junction (Mandl et al (1991) 
Biotechniques 10:486). 5' terminal sequences were determined for a number of HCV 
isolates and are in general agreement. For HCV-H, both the cyciization/inverse PCR and 
5' RACE methods were used to determine a 5'-terminal consensus sequence: 

5'-GCCAGCCCCCTGATGGGGGCGACACTCCACCATGAAATC-3' (SEQ ID NO:37) 

This sequence is highly homologous to those determined for other isolates. At lower 
frequency, clones with additional 5' residues (usually 1 additional G) were also 
recovered. Although this might reflect additional sequences or heterogeneity at the 
HCV 5' terminus, these clones may be artifactual and created by partial copying of a 5' 
cap structure or addition of non-templated 3' bases by reverse transcriptase during 
first-strand cDNA synthesis. It cannot be excluded that the 5' terminus of HCV genome 
RNA contains a 5' cap structure or a covalently-linked terminal protein such as VPg of 
the picornaviruses. For the pestiviruses, recent studies suggest that genome RNAs may 
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not contain a 5' cap (Brock et a! (1992) J. Virol. Meth. 38:39-46) and that this 
structure is not required for infectivity of transcribed RNA (R. Donis, R. J. Moorman, 
personal communications). Consistent with these observations, neither HCV nor the 
pestiviruses contain motifs characteristic of virus-encoded enzymes involved in capping 
or methylation of cap structures (Rice (1995)). There is the possibility of a protein 
covalently-linked to the 5' terminus of HCV genome RNA, but synthetic RNA transcripts 
corresponding to viruses that normally contain such 5' structures are infectious, 
indicating that they are usually not an absolute requirement for initiation of replication. 

EXAMPLE 1 



Structure of the HCV-H 3' NTR. Determination of the extreme 3' terminal HCV 
sequences presented a greater challenge. Due to limited quantities of HCV genome 
RNA, the classic method of 3'-end labeling and direct RNA sequence analysis has not 
been feasible. One of the first reports suggested that HCV RNAs contained 3' terminal 
poly (A) tracts (for HCV-1) Han et al (1991) used tagged oligo (dT) primers for cDNA 
synthesis followed by PCR amplification, cloning and sequence analysis. This is not an 
acceptable method for 3' end determination as it already presupposes a 3' poly (A) or 
polypurine tract and would select for such RNAs even if they were present in low 
abundance. Some reports utilized £. co/i poly (A) polymerase to add 3' homopolymer 
tracts prior to oligo (dT)-primed cDNA synthesis and found evidence of a 3\ terminal 
poly (U) (i.e., Kato et al, 1990) or isolated 3' clones with poly (U) tracts from randomly 
primed cDNA libraries. To actually determine the 3' terminal sequence and to solve this 
contradiction [3' poly (A) versus poly (U)], 5' RACE was used to determine the 5' 
terminal sequence of HCV negative-strand RNA (Chen et al, 1992). This study 
predicted a 3' terminal poly (U) tract for the HCV genome RNA. Subsequently, other 
groups have not attempted to determine actual 3' termini, but rather assumed a 3' poly 
(U) tract and used oligo (dA) for priming cDNA synthesis or isolated 3' clones with poly 
(U) tracts from randomly primed cDNA libraries. Depending upon the actual 3' 
structure, all of these approaches have potential problems (some are discussed below) 
and a critical reading of the literature makes it clear that the 3' end of the genome RNA 
has until now been poorly characterized and uncertain at best. Alternative approaches 
for determining the terminal sequence of HCV-H were therefore pursued. 



One such approach, mentioned above for 5' end determination, is to cyclize the RNA 
using t4 RNA ligase followed by cDNA synthesis and amplification using a 
negative-sense primer complementary to a sequence in the 5' NTR and a positive-sense 
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primer near the 3' end (Mandl et al, 1991). Ideally, cloning and sequence analysis of 
such products should provide information on 5' and 3' terminal sequences present in the 
same RNA molecule. In the case of HCV, since the 5' terminus has been reasonably 
well-defined by other methods, these data should allow determination of the 3' 
terminus. Unfortunately, despite repeated attempts, this method has failed for 
determination of HCV-H terminal sequences (even using RNA obtained from high 
specif ic-infectivity plasma). Potential problems include (i) a blocked 5' terminus (ii) lack 
of a 3' -OH group or a poor acceptor for RNA ligase (such as a U residue; Moore and 
Sharp, 1992) (iii) ribonuclease activity during RNA preparation or RNA ligation (see 
below) or (iv) terminal RNA structures sterically inhibiting 5'-3' ligation. In any case, 
recent work has shown this method to be unreliable, even when using large quantities 
of purified, initially intact TBE genome RNA. Likely problems include RNase 
contamination in most commercial preparations of tobacco acid pyrophosphatase and 
T4 RNA ligase (A. A. Koiykhalov, unpublished data) and probable hypersensitivity of the 
poly (U)/polypyrimidine tract to the action of RNases. In any case, poly (A) was 
incorrectly assigned as the terminal sequence of TBE genome RNA, whereas later 
experiments demonstrated that such poly (A) tracts were internal and followed by 
additional sequences including a 3' terminal hairpin structure (C. Mandl, personal 
communication). Interestingly, the correct 3' structure, but not the poly (A) tract, is 
required for infectivity of transcribed TBE RNA {C. Mandl, personal communication). 

Two other methods were considered. For determining the YF 3' terminus, £. coli poly 
(A) polymerase had been used to add 3' terminal poly (A), followed by oligo (dT) priming 
and a selective cloning strategy. The YF 3' terminus was cloned with great difficulty 
and found to be a highly stable hairpin structure (Hahn et al, 1987; Rice et al, 1985). 
However, this approach was not considered since addition of 3' poly (A) would allow 
self-priming at the HCV poly (U) tract and subsequent elimination of potential sequences 
in between during second-strand cDNA synthesis and cloning. Rather, an alternative 3' 
RACE method was used in which a synthetic oligodeoxynucleotide, present at high 
concentrations, was ligated to the 3' end of the RNA to serve as a specific priming site 
for cDNA synthesis (Figure 1). Ligation conditions were optimized by assaying the 
ability of T4 RNA ligase to ligate 5 '-end-labeled oligonucleotides to a synthetic acceptor 
RNA (Brennan et al (1983) Meth. Enz. 100:38-52). Critical parameters included the 
batch of RNA ligase (many were heavily contaminated with RNase), the concentration 
of DMSO (20-30%), and the particular oligonucleotide used for ligation (A. A. 
Koiykhalov, unpublished). For the 3' analysis of HCV-H, 10 4 molecules of RNA were 
purified from high-titered H77 plasma, ligated to the synthetic oligonucleotide, and this 
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modified RNA used for RT/PCR (Figure 1). One primer for cDNA synthesis and PCR 
amplification (oiigo B - SEQ ID NOS:6 and 7) was complementary to that used for 
ligation to the RNA (oligo A - SEQ ID NO:5); a second positive-sense primer 
corresponded to a sequence near the 3' end of the HCV ORF (oligo C - SEQ ID NO:8). 

A smear of amplified products, as resolved by agarose gel electrophoresis, was obtained 
after 40 cycles of PCR amplification. This DNA was either subjected to additional PCR 
analyses or cloned directly for sequence determinations. The presence of predicted 
internal HCV sequences and homopolymer tracts was assayed using a nested positive 
sense primer and either oligo (dA) or oligo (dT). A product of the expected size (based 
on previous HCV 3' NTR sequences) was obtained using the oligo (dA) primer; no 
product was found using oligo (dT). Prototype HCV cDNA clones terminating in either 
poly (A) or poly (T) served as positive and negative controls for these primer pairs and 
gave the expected results. These data strongly suggest that HCV-H does not contain 
poly (A) but rather, as found for most HCV isolates, contains a poly (U) tract (or at least 
a site for priming by oligo (dA)] at or near its 3' terminus. From the cloned material 
[which had not be subjected to further amplification using oligo (dA)], sequences from 
20 independent clones were determined. Essentially all of these clones contained (5' to 
3', positive-sense) (i) the previously determined HCV-H sequence (Inchauspe et al 
(1991) Proc. Nat/. Acad. Sci. USA 88:10292-10296) (ii) 40 bases homologous to other 
HCV isolates (Figure 2) fiii) poly (U) tracts of various lengths and (iv) the sequence of 
the oligonucleotide used for RNA ligation. Five independent clones, derived from two 
different PCR amplification experiments, were found to have unusual structures. 
Following variable lengths of poly (U) and polypyrimidine stretches consisting of mainly 
U with occasional interspersed C residues, four of these clones contained a novel 
sequence of 101 bases (SEQ ID NO:1) which was nearly identical in all clones (two 
clones differed by 1 substitution each; 1 clone terminated after only 39 bases of this 
sequence) (Figure 3). This 101 base sequence, particularly the 3' terminal 46 bases, is 
predicted (FOLDRNA, GCG package) to form a highly stable secondary structure 
reminiscent of the 3' termini of members of the flavi virus genus (Chambers et al (1990) 
Virology 177:159-174) (Figure 4). However, an exhaustive search (BLAST, FASTA) of 
the databases has revealed no entry showing significant homology to this novel HCV 
sequence. 

Several lines of evidence suggest that this 101 base sequence is not an RT/PCR artifact 
and represents the 3' terminal sequence of HCV genome RNA. 
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First, as mentioned above, HCV-H clones with similar but not identical structures were 
obtained from two independent experiments (Figure 3). These clones differed in the 
length of the poly (Ul/polypyrimidine tract and by a few base substitutions within the 
101 base element, but the sequences across the breakpoint between the novel 
sequence and the ligated oligonucleotide were identical in four clones. 

In a second set of experiments, negative-sense oligonucleotides were designed based on 
the sequence of the 101 -base element, and used for RT/PCR amplification and cloning 
of HCV RNA from either HCV-H or four different clinical samples obtained from 
investigators around the country. Samples were obtained from patients with chronic 
hepatitis C and all were HCV RNA positive and of different genotypes (1b, 3 and two 
samples of genotype 4) than previously analyzed HCV-H(la). Multiple independent 
clones were obtained and sequenced (Figure 6) for each amplified sample. As described 
above, for HCV-H, all clones contained identical sequences at the end of the ORF and 
the 3' NTR sequence preceding the poly (U)/polypyrimidine tract. This poly 
(U)/polypyrimidine tract was variable in length and followed by the novel 3' element. 
Clones from the other isolates had similar structures except that genotype-specific 
differences were observed in the ORF and 3' NTR sequence preceding the poly 
(U)/polypyrimidine tract. The sequence of the novel 3' element was present and 
absolutely identical in all of these clones suggesting that this element is both present in 
the genome RNAs of distinct genotypes and highly conserved. This experiment also 
proves that this structure is not an in vitro artifact generated by T4 RNA ligase. * 

This analysis demonstrated that the novel 3' element was present in other HCV 
genotypes, but did not define the actual 3' end of these genome RNAs. Using a similar 
RNA ligation procedure (Figure 7), the 3' terminal sequences of these different HCV 
genotypes were determined. The same novel 3' terminal sequence (with one or two 
isolate-specific substitutions) was found joined, at exactly the same breakpoint, to the 
sequence of the oligonucleotide used for T4 RNA ligation (Figure 8). 

A fourth experiment provides yet more evidence that the 3' novel sequence represents 
the 3' terminus of HCV genome RNA. It could be argued that clones with the novel 
structure could be obtained by internal priming within the 3' NTR if, by chance, the 3' 
portion of the synthetic primer used for cDNA synthesis (and PCR amplification) was 
complementary to a sequence within the HCV 3' NTR. To address this concern, the 
analysis was repeated using serum from a different patient (WD) and a distinct 
oligonucleotide Coiigo G"; SEQ ID NO:37) in the RNA ligation step, whose sequence 
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was not homologous to the oligonucleotide used in the initial experiments. The 
complement of this oligonucleotide ("oligo H"; SEQ ID NO:38)was used for cDNA 
synthesis and PCR amplification together with "oligo F w (SEQ ID NO:32), and the 
products were cloned and sequenced. The same novel 3' terminal sequence was found 
5 joined, at exactly the same breakpoint, to the sequence of the alternative synthetic 
oligonucleotide. 

This novel 3' NTR structure appears to be highly conserved among HCV isolates and is 
likely to be an essential RNA element required for virus replication and successful 
0 recovery of infectious HCV RNA from cDNA. 

Based on these data, the current picture of HCV-H genome structure is diagrammed in 
Figure 10. The genome RNA probably initiates with a G residue and contains a 5' NTR 
of 341 bases. The ORF consists of 9033 bases encoding a polyprotein of 301 1 amino 
5 acid residues. Following the opal (UGA) stop codon is a sequence of 40 bases, a poly 
(U) tract, a polypyrimidine stretch, and a highly conserved RNA element of about 100 
bases. Some positive-strand RNA viruses (poliovirus, Sindbis virus) contain 3' terminal 
poly (A) but many others terminate with conserved RNA sequences which can often be 
folded into stable secondary structures (bromoviruses, flaviviruses). Besides TBE 
isolates containing internal poly (A) followed by a 3' terminal secondary structure, there 
is one other example of a virus which contains a 3' NTR similar to that proposed here 
for HCV. This virus, called GBV-B, is one of the newly cloned and sequenced GB 
hepatitis agents (Simons et al (1995) Proc. Natl. Acad. Sci. USA 92:3401-5). These 
two isolates (GBV-A and GBV-B) together with HGV (an agent cloned by Genelabs) 
appear to be most closely related to HCV. All appear to have positive-strand genome 
RNAs of -9-10 kb and a single long ORF encoding proteins with significant homology 
to those of HCV. In the case of GBV-B, the 3' NTR consists of 27 bases, a poly (U) 
tract, and an additional 

sequence of 49 bases (Simons et al, 1995). Other than the poly (U) tract, this 
sequence shows no significant homology with the HCV-H 3' NTR (A. A. Kolykhalov, 
unpublished). 

Discussion 

The present invention provides sequence data which demonstrate that the genome RNA 
of HCV-H (a type 1a isolate) appears not to terminate with a homopolymer tract as 
previously thought, but rather with a novel sequence of 101 bases. Furthermore, 
results suggest that this 3' NTR structure and the putative 3' terminal element may be 
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features common to other HCV genotypes. In addition to the potential importance of 
the 3' NTR for HCV replication and recovery of authentic HCV from cDNA, the apparent 
conservation of the conserved 3' element has important applications for HCV 
diagnostics and therapy. Determination of HCV RNA levels in patient plasma and 
tissues is important not only for diagnosis of HCV infection in the absence of antibody 
response but also for following the efficacy of therapeutic regimens (Bresters et al 
(1994) J. Med. Vfrot. 43:262-8; Cha et al (1991) J. Clin, Microbiol. 29:2528-34; 
Chazouilleres et al (1994) Gastroenterology 106:994-9; Davis et al (1994) Hepatology 
19:1337-41; Feray et al (1994) Hepatology 20:1137-43; Gordon et al (1994) Am. J. 
Gastroenterol. 89:1458-61; Simmonds et al (1994) J. Gen. Virol. 75:1053-1061; 
Wright et al (1994) Hepatology 20:773-9). Current methods, such as quantitative 
RT/PCR or branched DNA, rely on conserved RNA targets in the HCV genome which can 
be either genus-, type-, or subtype-specific. Detection of this novel, conserved 
sequence may be a useful alternative for diagnosis of HCV infection. In terms of 
therapy, highly conserved elements in RNA virus genomes have, in most cases, been 
shown to be essential for efficient virus replication. Such elements, via interaction with 
viral and/or host factors, function in translation of the incoming viral RNA, as promoters 
for negative- and positive-strand RNA synthesis, and as signals for selective packaging 
of viral RNAs. A conserved 3' element in the HCV genome, which is likely to be 
important for one or more of these processes, presents an attractive therapeutic target. 
Identification of compounds which block interaction of this element with its cognate 
host or viral factors or gene therapy approaches using this element as an RNA decoy 
(Sullenger et al (1990) Celt 63:601-608) in transplanted hepatocytes may prove useful 
in eradicating or controlling chronic HCV infections. 



The present invention demonstrates that there may be an association of the novel 
sequence element and HCV infection. The novel 3' sequence element may be used for 
(i) constructing full-length HCV-H cDNA clones capable of yielding infectious RNA and 
virus (vaccine development, evaluation of therapeutic compounds); (ii) engineering 
functional HCV RNA replicons for HCV replication studies and therapeutic evaluation; 
(iii) the determination of 3' NTR sequences for other HCV genotypes and a phylogenetic 
analysis to define areas of conservation and divergence (nucleic acid based diagnostics 
for HCV detection); (iv) the improvement of methods for determining the HCV 3' NTR 
sequences and an examination of possible correlations between HCV 3' NTR features 
and clinical parameters (disease severity, IFN response, immune status) or tissue 
tropism (predictive diagnostics); (v) determination of the 3' NTR secondary structure 
using chemical modification and RNase mapping and determination of 3D structure by 
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NMR; (vi) structure/function studies on the 3' NTR using the infectious clone (therapy, 
vaccine development); (vii) definition of host or viral factors which interact with the 
sequence (therapy); (viii) setting up screening assays to identify compounds which 
inhibit the interaction of the element with its cognate host and/or viral factors and to 
5 test identified compounds for their effects on HCV replication(therapy); (ix) testing the 
conserved 3' NTR sequence element as a trans-dominant inhibitor of HCV replication 
(gene therapy). 
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WHAT IS CLAIMED IS: 

11. A nucleic acid molecule or degenerate variant thereof, which encodes a 3' 

2 terminal sequence element, or a fragment thereof, selected from the group consisting 

3 of: 

4 (A) the nucleotide sequences of SEQ ID N0S:1-4; 

5 (B) the nucleotide sequences of FIGURE 3 (SEQ ID NOS:20-24); 

6 (C) the nucleotide sequences of FIGURE 6 (SEQ ID NOS:28-31); 

7 (D) the nucleotide sequence of FIGURE 8 (SEQ ID NO:33-36); 

8 (E) nucleotide sequences that hybridize to any of the foregoing sequences 

9 under standard hybridization conditions; and 

10 (F) nucleotide sequences having a secondary structure substantially similar 

11 to the structure of Figure 4. 

1 2. The nucleic acid molecule of Claim 1, wherein the 3' terminal sequence element 

2 is present in HCV. 

13. A recombinant DNA or RNA molecule comprising a nucleotide sequence or 

2 degenerate variant thereof, which encodes a 3' terminal sequence element, or a 

3 fragment thereof, selected from the group consisting of: 

4 (A) the nucleotide sequences of SEQ ID N0S:1-4; 

5 (B) the nucleotide sequences of FIGURE 3 (SEQ ID NOS:20-24); 

6 (C) the nucleotide sequences of FIGURE 6 (SEQ ID NOS:28-31); 

7 (D) the nucleotide sequences of FIGURE 8 (SEQ ID NOS:33-36); 

8 (E) nucleotide sequences that hybridize to any of the foregoing sequences 

9 under standard hybridization conditions; and 

1 0 (R nucleotide sequences having a secondary structure substantially 

1 1 homologous to the structure of Figure 4. 

1 4. The nucleic acid molecule or recombinant DNA or RNA of Claims 1 , 2 or 3, 

2 wherein said 3' terminal sequence element is operatively linked to an expression control 

3 sequence. 

15. A probe capable of screening for the 3' terminal sequence element of alternate 
2 strains of HCV prepared from the nucleotide sequence of Claim 2. 

1 6. The probe of Claim 5 labeled with a detectable marker. 
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7. A host cell transformed with a recombinant DNA or RNA molecule comprising a 
DNA or RNA sequence or degenerate variant thereof, which encodes a 3' terminal 
sequence element, or a fragment thereof, selected from the group consisting of: 

(A) the nucleotide sequences of SEQ ID NOS:1-4; 

(B) the nucleotide sequences of FIGURE 3 (SEQ ID NOS:20-24); 

(C) the nucleotide sequences of FIGURE 6 (SEQ ID NOS:28-31); 

(D) the nucleotide sequences of FIGURE 8 (SEQ ID NOS:33-36); 

(E) nucleotide sequences that hybridize to any of the foregoing sequences 
under standard hybridization conditions; and 

(F) nucleotide sequences having a secondary structure substantially similar 
to the structure of Figure 4. 

8. The host cell of Claim 7 wherein the host is a human cell in tissue culture. 

9. The host cell of Claim 8, wherein the host cell is a liver cell. 

10. A method for detecting the presence or activity of the 3' terminal sequence 
element of Claim 1, comprising: 

A. contacting a biological sample from a mammal in which the 
presence or activity of a virus containing the 3' terminal sequence element is suspected 
with a binding partner of said 3' terminal sequence element under conditions that allow 
binding of said 3' terminal sequence element to said binding partner to occur; and - 

B. detecting whether binding has occurred between said 3' terminal 
sequence element from said sample and the binding partner; 

wherein the detection of binding indicates that presence or activity of said virus 
in said sample. 

1 1 . The method of Claim 1 0, wherein the virus is HCV. 

12. A method for detecting the presence and activity of a virus in mammals 
comprising detecting the presence or activity of a 3' terminal sequence element 
according to the method of Claim 10, wherein detection of the presence or activity of 
the 3' terminal sequence element indicates the presence and activity of in said 
mammals. 
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1 13. A method for detecting binding sites for the 3' terminal sequence element of 

2 Claim 1, comprising: 

3 A. placing a labeled binding partner which specifically binds to said 

4 3' terminal sequence element in contact with a biological sample from a mammal in 

5 which binding sites for said 3' terminal sequence element are suspected; and 

6 B. examining said biological sample in binding studies for the 

7 presence of said labeled binding partner; 

8 wherein the presence of said labeled binding partner indicates a binding site for 

9 said 3' terminal sequence element. 

1 14. A method of testing the ability of a drug or other entity to modulate the activity 

2 of the 3' terminal sequence element of Claim 1 which comprises: 

3 A. culturing test cells which are infected with a virus or transfected with a 

4 nucleic acid containing said 3' terminal sequence element; 

5 B. adding the drug under test; and 

6 C. measuring the replication of said virus or nucleic acid in said test cells. 

1 15. The method of Claim 14, wherein the virus is HCV or the nucleic acid is an HCV 

2 RNA replicon. 

1 16. An assay system for screening drugs and other agents for ability to modulate 

2 the replication of a virus containing the 3' terminal sequence element of Claim 1 , 

3 comprising: 

4 A. culturing observable test cells inoculated with a drug or agent directed to 

5 the 3' terminal sequence element; 

6 B. harvesting a supernatant from said test cells; and 

7 C. examining said supernatant for the presence of said virus wherein an 

8 increase or a decrease in a level of said virus indicates the ability of a drug to modulate 

9 the replication of said virus. 

1 17. The assay system of Claim 16, wherein the virus is HCV. 

1 18. An assay system for screening drugs and other agents for ability to modulate 

2 the replication of a virus containing the 3' terminal sequence element of Claim 1 , 

3 comprising: 

4 A. culturing observable test cells inoculated with a drug or agent directed to 

5 the 3' terminal sequence element; 
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6 B. contacting said test cells with a labelled binding partner of or a primer 

7 which will amplify the genome of said virus; and 

8 C. determining the level of viral genome in the test cells wherein an 

9 increase or a decrease in a level of said genome indicates the ability of a drug to 
10 modulate the replication of said virus. 

1 19. The assay system of Claim 18, wherein the virus is HCV. 

1 20. A test kit for the demonstration of a virus in a eukaryotic cellular sample, 

2 comprising: 

3 A. a predetermined amount of a detectably labelled specific binding 

4 partner of the 3' terminal sequence element of Claim 1; 

5 B. other reagents; and 

6 C. directions for use of said kit. 

1 21 . The test kit of Claim 20, wherein the virus is HCV. 

1 22. A method of preventing and/or treating cellular debilitations, derangements 

2 and/or dysfunctions and/or other disease states in mammals caused by a virus 

3 containing the 3' terminal sequence element of Claim 1, comprising administering to a 

4 mammal a therapeutically effective amount of a material selected from the group 

5 consisting of the 3' terminal sequence element, its complement or a fragment thereof, 

6 an agent capable of promoting the production and/or activity of said 3' terminal 

7 sequence element, an agent capable of mimicking the activity of said 3' terminal 

8 sequence element, an agent capable of inhibiting the activity of said 3' terminal 

9 sequence element, and mixtures thereof, or a specific binding partner thereto. 

1 23. The method of Claim 22, wherein the virus is HCV. 

1 24. The method of Claim 22 wherein said disease states include hepatitis, cirrhosis, 

2 and hepatocellular carcinoma. 

1 25. The method of Claim 22 wherein said 3' terminal sequence element, its 

2 complement or a fragment thereof, an agent capable of promoting the production and/or 

3 activity of said 3' terminal sequence element, an agent capable of mimicking the activity 

4 of said 3' terminal sequence element, an agent capable of inhibiting the activity of said 

5 3' terminal sequence element, and mixtures thereof, or a specific binding partner thereto 
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is administered to modulate the course of therapy wherein a primary therapeutic agent 
is also being administered. 

26. The method of Claim 22 wherein said 3' terminal sequence element, its 
complement or a fragment thereof, an agent capable of promoting the production and/or 
activity of said 3' terminal sequence element, an agent capable of mimicking the activity 
of said 3' terminal sequence element, an agent capable of inhibiting the activity of said 
3' terminal sequence element, and mixtures thereof, or a specific binding partner thereto 
is administered to modulate the course of therapy where interferon is being administered 
as the primary therapeutic agent. 

27. The method of Claim 22 wherein said 3' terminal sequence element, its 
complement or a fragment thereof, an agent capable of promoting the production and/or 
activity of said 3' terminal sequence element, an agent capable of mimicking the activity 
of said 3' terminal sequence element, an agent capable of inhibiting the activity of said 
3' terminal sequence element, and mixtures thereof, or a specific binding partner thereto 
is administered to modulate the course of therapy where two or more additional 
therapeutic agents are being co- administered. 

28. The method of Claim 22 wherein said 3' terminal sequence element, its 
complement or a fragment thereof, an agent capable of promoting the production and/or 
activity of said 3' terminal sequence element, an agent capable of mimicking the activity 
of said 3' terminal sequence element, an agent capable of inhibiting the activity of said 
3' terminal sequence element, and mixtures thereof, or a specific binding partner thereto 
is administered to modulate the course of therapy where interferon is being co- 
administered with one or more additional therapeutic agents. 

29. A pharmaceutical composition for the treatment of cellular debilitation, 
derangement and/or dysfunction in mammals, comprising: 

A. a therapeutically effective amount of a material selected from the 
group consisting of the 3' terminal sequence element of Claim 1, its complement or a 
fragment thereof, an agent capable of promoting the production and/or activity of said 
3' terminal sequence element, an agent capable of mimicking the activity of said 3' 
terminal sequence element, an agent capable of inhibiting the activity of said 3' terminal 
sequence element, and mixtures thereof, or a specific binding partner thereto; and 

B. a pharmaceutical^ acceptable carrier. 
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1 30. An attenuated virus comprising the 3' terminal sequence element of Claim 1 in 

2 combination with a sequence which is substantially homologous to that of HCV-1, HC- 

3 Jl„ HC-J, HCV-BK, HCV-H, HC-J6, HC-J8, HC-J483, HC-J491, HC-C2, HCV-JK, HCV- 

4 N, HCV-T, HCV-JT, HC-G9, HCV-K3a, NZL1, or HCV-Tr, which virus is replication- 

5 impaired. 

1 31. An antisense nucleic acid against a viral genome RNA or its complement 

2 comprising a nucleic acid sequence hybridizing to one of the nucleic acid molecules of 

3 SEQ ID NOS:1-12, 20-24, 28-31 and 33-36 or their complements. 

1 32. The antisense nucleic acid of claim 31 which comprises an oligonucleotide 

2 modified to enhance stability, uptake or targeted intracellular degradation. 

1 33. The antisense nucleic acid of Claim 31 which is RNA. 

1 34. The antisense nucleic acid of Claim 31 which is DNA. 

1 35. A recombinant DNA or RNA molecule having a DNA or RNA sequence which, on 

2 transcription, produces an antisense ribonucleic acid against a viral genome RNA or its 

3 complement, said antisense ribonucleic acid comprising a nucleic acid sequence which 

4 hybridizes to one of the nucleic acid molecules of SEQ ID N0S:1-4, 20-24, 28-31 and 

5 33-36 or their complements. 

1 36. A virus-producing cell line transfected with the recombinant DNA or RNA 

2 molecule of Claim 35. 

1 37. A ribozyme that cleaves the viral mRNA of Claim 35. 

1 38. The ribozyme of Claim 37 which is a Tetrahymena-type ribozyme. 

1 39. The ribozyme of Claim 37 which is a Hammerhead-type ribozyme. 



1 
2 



40. A recombinant DNA molecule having a DNA sequence which, upon transcription, 
produces the ribozyme of Claim 37. 
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1 41 . A virus-expressing cell line transfected with a recombinant DNA or RNA 

2 molecule comprising a replication competent HCV RNA comprising the 3' terminal 

3 sequence element of claim 1 . 



1 42. A virus-producing cell line transfected with a recombinant DNA or RNA molecule 

2 comprising the 3' terminal sequence element of Claim 1 in combination with a sequence 

3 which is substantially homologous to that of HCV-1, HC-J1 , HC-J, HCV-BK, HCV-H, 

4 HC-J6, HC-J8, HC-J483, HC-J491, HC-C2, HCV-JK, HCV-N, HCV-T, HCV-JT, HC-G9, 

5 HCV-K3a, N2L1 , or HCV-Tr. 



PCT/US96/14033 



1/10 



FIG.I 



Genomic RNA Oligo A 

3' 5' 3' 

-OH (g > » 



1 



RNA-ligase 



RNA-oligoA 




5' 



Reverse transcription 
with oligo B 



RNA-oligoA 



3' 



cDNA 



1 



oligo B 



PCR with 
oligo B* oligo C 



— ih 



oligo C 



oligo B 



dsDNA 



I Cloning of the PCR- fragment 
t and sequence-analysis 



SUBSTITUTE SHEET (RULE 26) 



WO 97/08310 



PCT/US96/14033 



2/10 



CN] 

6 
LL 





c 




rd 








XJ 




O 




O 




Cn 








I 


o 


•• o 


o 


.. o 


XJ 




u 


•• u 


rd 




o 


.. u 


o 


rd 


rd 


(d 




rd 


XJ 




o 


Cn 


cn 


.. cn 




cn 


cn 


cn 


XJ 


•• XJ 






rd 


Cn 


Cn 


.. en 


rd 


• • rd 


< 


< 






H < 





c 

XJ 
XJ 

Cn 
j_> 

o 
o 

XJ 
4J 
XJ 
(0 
U 
U 

cn 
Cn 
rd 

4-> 

xj 
o 

' XJ 

o 

• o 

Cn 
Cn 
I 

U 
O 
XJ 
O 

rd 
o 

rd 
»TJ 
(0 
XJ 

Cn 

Cn 

Cn 

Cn 

XJ 

XJ 

Cn 

Cn 

rd 

< 



C 
XJ 



Cn 

XJ 

Cn 

XJ 

O 

XJ 
XJ 
XJ 

• XJ 

• <TJ 

• O 

• O 

• Cn 

• Cn 

• rd 

• XJ 

• XJ 

• u 

• XJ 

• o 

• o 
■ Cn 
. cn 
' I 

U 

u 

XJ 
O 
rd 

o 
(d 
«d 
rd 

XJ 

Cn 
Cn 
Cn 
Cn 

XJ 
XJ 

Cn 
Cn 
(d 

% 



C C 
XJ XJ 



O 
U 
O 
O 
XJ 

<d 
u 
o 



Cn 

XJ 

O 
O 

XJ 
05 
O 
O 



Cn cn 
Cn Cn 
(d rd 

XJ XJ 

I 

rd 
rd 
O 
O 



l 

<d 
<d 
o 
_ o 
Cn cn 
Cn cn 
rd (0 



c 

XJ 

cn *c 

XJ xj 

xj *— 

xj O 

xj O 

O O 

0 XJ 

xj cn 

XJ XJ 

xJ u 

rd rd 

U <d 

U xj 

Cn (J 

Cn cn 

rd rd 

XJ XJ 

1 I 
rd I 
rd rd 
O O 
O O 



C — 
4-> C 
XJ 

— Cn ~ 
C xj Cn 

XJ XJ XJ 

" — xJ Cn 

Cn xj xj 

u xj o 

U xJ O 

xj U O 

xj xJ rd 

4-> XJ 

Cn xj 

rd i 

xj i 

0 I 
Cn i 
rd I 
xJ I 

1 I 
I I 



O 
O 
XJ 
O 
rd 
O 
rd 
rd 
<0 

XJ 

ro 



U 
XJ 
U 
rd 
U 
<0 
fd 
fd 

XJ 

u 



Cn (J 
rd rd 



Cn cn 
rd m 
Cn cn 
Cn cn 
Cn cn 
Cn cn 



O 
O 

XJ 

O 
rd 
O 
rd 
rd 
rd 

XJ 

O 



O 
rd 
*J 
O 
Cn 
rd 

XJ 
XJ 

i 

O 
<d 



O 
U 



O 
rd 



O 
rd 



Cn O 
*d rd 
Cn U 
Cn Cn 
Cn Cn 
Cn U 
U Cn 



< < 



rd 
< 



rd 
< 



rd 
O 
O 
xj 
o 

rd rd rd 

O o U 
rd U 

XJ XJ _ 

O O O 

Cn rd rd 

rd O O 

xJ rd rd 

O rd rd 

O XJ xJ 

O rd xJ 
rd cn Cn 

rd rd cn 

rd rd rd 

U xj xJ 
Cn Cn Cn 
Cn cn cn 

U XJ XJ 

Cn u O 

rd Cn cn 

O < < 

< C3 O 

H H H 







2£ 




Id 




T-l 


t— t 








1 


X 


«H 


X 


1 


1 


1 


> 


> 


> 


u 


CJ 


CJ 


X 


X 


X 



rd 

f0 X) J3 — rd J3 

H ^ S oo N M 

^ CQ h 2 b b 



Jill 
> > > > 



t 

> 



2 H 



uuuuuuuu 
xxxxxxxx 



SUBSTITUTE SHEET (RULE 26) 



WO 97/08310 



PCT/US96/14033 



3/10 

FIG. 3 



H7 7 - # 1 . 2 GTGTCTCATGCCCGGCCCCGCTGGTTCTGGTTTTGCCTACTCCTGCTCAC 

H7 7 - # 8 GTGTCTCATGCCCGGCCCCGCTGGTTCTGGTTTTGCCTACTCCTGCTCGC 

H 7 7 - # 1 0 GTGTCTCATGCCCGGCCCCGCTGGTTCTGGTTTTGCCTACTCCTGCTCGC 

H77- #7 4 GTGTCTCATGCCCGGCCCCGCTGGTTCTGGTTTTGCCTACTCCTGCTCGC 

H7 7 - # 5 GTGTCTCATGCCCGGCCCCGCTGGTTCTGGTTTTGCCTACTCCTGCTTGC 

H77- # 1 . 2 TGCAGGGGTAGGTATCTACCTCCTCCCCAACCGATGAAGGTTGGGGTAAA 

H 7 7 - # 8 TGCAGGGGTAGGCATCTACCTCCTCCCC AACCG ATGAAGGTTGGGGT AAA 

H77- #1 0 TGCAGGGGTAGGCATCTACCTCCTCCCC AACCG ATGAAGGTTGGGGT AAA 

H77 - # 7 4 TGCAGGGGTAGGQATCTACCTCCTCCCCAACCGATGAAGGTTGGGGTAAA 

H77-#5 TGCAGGGGTAGGCATCTACCTCCTCCCCAACCGATGAAGGTTGGGGTAAA 

H77- # 1 . 2 CACTCCGGCCTCTTAGGCCATTTCCTGTTTTTTTTTTTTTTTTTTTT . . . 

H77-#8 CACTCCGGCCTCTTAGGCCATTTCCTGTTTTTTTTTTTTTTTTTTTTTTT 

H7 7 - # 1 0 CACTCCGGCCTCTTAGGCCATTTCCTGTTTTTTTTTTTTTTTTTTTTTTT 

H77-#7 4 CACTCCGGCCTCTTAGGCCATTTCCTGTTTTTTTTTTTTTTCCCTTTTTT 

H7 7 - # 5 CACTCCGGCCTCTTAGGCCATTTCCTGTTTTTTTTTTTTTTTTTTTTjTTT 

H77-#1.2 

H77-#8 TTTTTTTTTTT 

H77- # 1 0 TTTTTTTTTTTTTTTTTTTT1 TTTTTTTCTTTTTTTTTTTTTTTTTCCTT 

H77- # 7 4 TTI TTT TT1 TTT rTTTTTTTl TT1 IT1 TTTT 

H77- #5 TTT TTT TTTTTTTTTTTTTTTTTT TTT TTTT 

H7 7-#1.2 CTTTCCTTCTTTTTT . . CCTTTCTTTTCCTTC 

H77- # 8 , CTTTTCCTTCTTTTTC . . CCTTTTTCTTTCTTC 

H77-#10 TTTTTTTTTTTTTTTTTTCTTTCCTTCTTTTTT . .CCTTTCTTTTCCTTC 

H77-874 CTTTCCTTCTTTTTTTTCCTTTCTTTTCCTTC 

H77- # 5 CTTTCCTTCCTTTTCC . CTTTTCTTTTC . TTC 

H77- # 1 . 2 CTTCTTTAATGGTGGCTCCATCTTAGCCCTAGTCACGGCTAGCTGTGAAA 

H77-#8 CTTCTTTAATGGTGGCTCCATCTTAGCCCTAGTCACGGCTAGCTGTGAAA 

H77- # 1 0 CTTCTTTAATGGTGGCTCCATCTTAGCCCTAGTCACGGCTAGCTGTGAAA 

H77- #74 CTTCTTTAATGGTGGCTCCATCTTAGCCCTAGTCACGGCTAGCTGTGAAA 

H7 7 - ft 5 CTTCTTTAATGGTGGCTCCATCTTAGCCCTAGTCACGGCTAGCTGT 

H77- # 1 . 2 GGTCCGTGAGCCGCATGACTGCAGAGAGTGCTGATACTGGCCTCTCTGCA 

H7 7 - # 8 GGTCCGTGAGCCGCATGACTGCAGAGAGTGCTGATACTGGCCTCTCTGCA 

H77 - # 10 GGTCCGTGAGCGGCATGACTGCAGAGAGTGCTGATACTGGCCTCTCCGC£ 

H77 - #74 GGTCCGTGAGCfiGCATGACTGCAGAGAGTGCTGATACTGGCCTCTCTGCT 
H77-#5 

H7 7 - # 1 . 2 GATCATGT 

H77-#8 GATCATGT 

H77-#10 GATCATGT 

H77-#74 GATCATGT 

H77-#5 
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FIG. 6 

1 50 
lb CTGTCTCGTG CCCGACCCCG CTGGTTCATG TTGTGCCTAC TCCTACTTTC 
3 GTGTCACGTG CCCGAACCCG CTATTTGCTG CTTTGCCTAC TCCTACTAAC 
4? GTGTCCCATG CCCGACCCCG CTATCTACTC CTGTGCCTAC TCCTACTTTC 
4a ATGTCTCATG CCCGACCCCG CTATTTACTC CTGTGCCTAC TCCTACTTAC 

^ 1 100 
lb CGTAGGGGTA GGCATCTACC TGCTCCCCAA CCGATGAA. . CGGGGAGCTA 
3 GGTAGGGGTA GGCATCTTTC TCCTGCCAGC GCGATGAGCT GGTAGG ATA 
4? CGTAGGGGTA GGCATCTTCC TGCTGCCTGC TCGATAGGCA GCT * TA 
4a AGTAGGGGTA GGCATCTTCC TGCTGCCTGC TCGGTAGGCA GCT TA 



101 150 
lb ACACTCCAGG CCAATAGG.C AT. .CCTGTT TTTTTTTTTT TTTTTTTTTT 

3 ACACT CC ATTTCTTTTT TTGTTTTTTT TTTTTTTTTT 

4 ? ACACTCCGAC CT . . TAGGGT CCTTC . TGTT TTTTTTTTTT TTTTTTTTTT 
4 a ACACTCCGAC CT . . TAGGGT CCCCT . TGTT TTTTTTTTTT 

151 200 

lb TTTTTTTTTT TTTTTTTTTT TTTTTTTTTT TTTTTTTTTT TT 

3 TTTTTTTTTT TTTTTTTTTT TTTTTTTTTT TTTTTTTTTT TTTTTTTTTT 
4? TTTTTTTTTT TTTTTTTTTT TTTTTTTTTT TTTTTTTTTT TTTTTTTTTT 
4a 



•••••••••» .......... .......... .......... CTTTT(*TTT 

3 ...CTTTTTC TTTCCTTTCT TTTCTGACTT CTAATt!!tT CCTTCTT..A 

4 ? TTTTTTCCTT ACCCTTTCCT TCTTTTCTTC CTTTTTTTTC CTTACTTT 
4a C TTTCCTTCTT T...CCTTTC CTAAT . CTTT CTTTCTT. '. 



251 

lb GGTGGCTCCA TCTTAGCCCT AGTCACGGCT A 
3 GGTGGCTCCA TCTTAGCCCT AGTCACGGCT A 
4? GGTGGCTCCA TCTTAGCCCT AGTCACGGCT A 
4a GGTGGCTCCA TCTTAGCCCT AGTCACGGCT A 
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FIG. 9 
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